REFERENCE: Derwent DGene Search Report 

FRANZ-BACON, et al., USSN: 09/099,898 
Atty. Docket No.: DX0744K 



Mouse C2 



MKTTTCSLLICISLLQLMVPVNTDETIEIIVENKVKELLANPANYPSTVTKTLSCTSVKTMNRWASCPAGMTATGCAC 
GFACGSWEIQSGDTCNCLCLLVDWTTARCCQLS 
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L2 ANSWER 1 OF 
ACCESSION NUMBER: 
TITLE : 

INVENTOR: 

PATENT ASSIGNEE: 

(UYSC-N) 
PATENT INFO: 
APPLICATION INFO: 
PRIORITY INFO: 
PAT. SEQ. LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE : 
OTHER SOURCE: 
DESCRIPTION: 

KEYWORD: 



9 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
96P-R98208 Protein DGENE 

Cell-targetted retroviral vector particles - having 
envelope protein modified with targetting polypeptide 
Anderson W; Chiang Y L; Januszeski M; Mackrell A J; 
Zhao Y 

(GENE-N) GENETIC THERAPY INC 
UNIV SOUTHERN CALIFORNIA 

WO 9630504 Al 961003 - 73 pp 

WO 96-US3908 960322 
US 95-409648 950324 
Example 2; Page 36 
30 DEC 1996 (first entry) 
Patent 
English 
96-455352 [45] 

Nucleotide used in production of MSH/MoMuLV chimeric 
sequence 

Moloney murine leukaemia virus; gp70; 4070A retrovirus; 
retrovirus; 10A1 murine leukaemia virus; NZB-9-1 murine 
leukaemia virus; poly tropic MX27 provirus; targetted 
drug delivery; gene therapy; single chain antibody; 
envelope protein; ss 
ORGANISM: Synthetic 
ABSTRACT : 

Cell targetted retroviral vector particles can be used in gene 
therapy to deliver a heterologous gene to a target cell for 
expression of a heterologous polypeptide in that cell. The cell 
targetted retroviral vector particles comprise an envelope protein 
which is modified to contain a targetting polypeptide (a single 
chain antibody) , or in the case of moloney murine leukaemia virus 
(MoMuLV) , alpha melanotropin-stimulating hormone (MSH). Two 
oligonucleotides (R98207, R98208) were used to substitute sequences 
in MoMuLV for MSH sequences. This oligonucleotide was used to 
replace residues G80-P88 of MoMuLV envelope protein (See W04248) 
AMINO ACID COUNTS: 8 A; 0 R; 0 N; 0 D; 0 B; 17 C; 0 Q; 0 E; 0 Z; 8 

0 H; 0 I; 0 L; 0 K; 0 M; 0 F; 0 P; 0 S; 11 
0 W; 0 Y; 0 V; 

SEQUENCE LENGTH: 
SEQUENCE 

1 catttccgat ggtgcaagcc ggtattaacc tccctcaccc ctcg 
ALIGN Smith-Waterman score: 86 
43 aa overlap starting at 5 

scpagmtatgcacgfacgsweiqsgdtcnclcllvdwttarcc 



G; 
T; 
44 



tccgatggtgcaag_ccgg_tattaacctccc_ 



tcaccc 



SEARCHED ON 26 OCT 1998 

FILE LAST UPDATED: 18 OCT 1998 <19981018/UP> 
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L2 ANSWER 2 OF 9 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
ACCESSION NUMBER: 98P-W56120 protein DGENE 

TITLE: Reducing severity of host versus graft reaction - 

comprises suppressing auto- immunity to heat-shock 
protein to prevent rejection 

INVENTOR: Birk O; Cohen I R 

PATENT ASSIGNEE: (YEDA)YEDA RES & DEV CO LTD 

PATENT INFO: WO 9808536 A2 980305 37 pp 

APPLICATION INFO: WO 97-US15294 970902 - 

PRIORITY INFO: US 96-706209 960830 

PAT. SEQ. LOC: Disclosure; Pages 28-29 

DATA ENTRY DATE: 08 JUL 1998 (first entry) 

DOCUMENT TYPE: Patent 

LANGUAGE: English 

OTHER SOURCE: 98-179175 [16] 

DESCRIPTION: Protein sequence of human heat shock protein (hsp) 60 

KEYWORD: Heat shock protein; hsp60; human; host versus graft 

reaction; HVGR; transplantation; organ; tissue; 

downregulatation; autoimmunity; prevention; graft 

rejection 
ORGANISM: Homo sapiens 

ABSTRACT : 

The present sequence represents human heat shock protein 60 
(hsp60) . This protein is a stress protein expressible in all cells 
of the body. The severity of a host versus graft reaction (HVGR) 
concominant with transplantation of donor organ or tissue, can be 
reduced by downregulating hsp60 autoimmunity in the host. The 
specification also describes a method for selecting peptides for 
preventing or suppressing graft rejection. This method comprises 
treating a panel of labelled peptides with antigen-presenting cells 
isolated from peripheral blood lymphocytes of the candidate host, 
and selecting those that bind with the antigen presenting cell. 
hsp60, or its peptides, analogues, salts and functional derivatives 
can be used for downregulating hsp60 autoimmunity especially for 
reducing HVGR. hsp60 autoimmunity can accelerate foreign immunity 
and its downregulation helps prevent graft rejection 

AMINO ACID COUNTS: 57 A; 22 R; 20 N; 37 D; 0 B; 3 C; 17 Q; 43 E; 0 Z; 

57 G; 3 H; 43 I; 48 L; 53 K; 18 M; 10 F; 19 P; 24 S; 
34 T; 1 W; 7 Y; 57 V; 

SEQUENCE LENGTH: 573 

SEQUENCE 

1 mlrlptvfrq mrpvsrvlap hltrayakdv kfgadaralm lqgvdllada 
51 vavtmgpkgr tviieqgwgs pkvtkdgvtv aksidlkdky knigaklvqd 
101 vanntneeag dgtttatvla rsiakegfek iskganpvei rrgvmlavda 
151 viaelkkqsk pvttpeeiaq vatisangdk eigniisdam kkvgrkgvit 
201 vkdgktlnde leiiegmkfd rgyispyfin tskgqkcefq dayvllsekk 
251 issiqsivpa leianahrkp lviiaedvdg ealstlvlnr lkvglqwav 
301 kapgfgdnrk nqlkdmaiat ggavfgeegl tlnledvqph dlgkvgeviv 
351 tkddamllkg kgdkaqiekr iqeiieqldv ttseyekekl nerlaklsdg 
401 vavlkvggts dvevnekkdr vtdalnatra aveegivlgg gcallrcipa 
451 ldsltpaned qkigieiikr tlkipamtia knagvegsli vekimqssse 
501 vgydamagdf vnmvekgiid ptkwrtall daagvasllt taevwteip 
551 keekdpgmga mggmgggmgg gmf 
ALIGN Smith-Waterman score: 81 

38 aa overlap starting at .442 
csllicisl lqlmvpvntdet ie i i venkvke 1 lanpa 



callrcipaldsltpanedq kigiei_ikrtlkipa 
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L2 ANSWER 3 OF 9 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
ACCESSION NUMBER: 97P-W12345 protein DGENE 

TITLE: New peptide (s) derived from human heat-shock protein 60 

- used for early diagnosis, prevention and treatment of 
insulin-dependent diabetes mellitus 

INVENTOR: Abulafia R; Bockova J; Cohen I R; Elias D 

PATENT ASSIGNEE: (YEDA)YEDA RES & DEV CO LTD 

PATENT INFO: WO 9701959 Al 970123 49 pp 

APPLICATION INFO: WO 96-US11375 960701 - 

PRIORITY INFO: IL 95-114407 950630 

PAT. SEQ. LOC: Disclosure; Page 28-29 

DATA ENTRY DATE: ' 13 NOV 1997 (first entry) 

DOCUMENT TYPE: Patent 

LANGUAGE: English 

OTHER SOURCE: 97-108693 [10] 

DESCRIPTION: Human heat-shock protein 60 

KEYWORD: Heat-shock protein; hsp; hsp60; insulin-dependent 

diabetes mellitus; IDDM 
ORGANISM: Homo sapiens 

ABSTRACT : 

The peptides given in W12346 to W12358 are derived from human hsp60 
(W12345) and are useful for early diagnosis of IDDM by detecting, 
in the blood or urine, antibodies or T-cells immunologically 
reactive with human hsp60 (presence of which indicates high 
probability of diabetes or its subsequent development) . Other 
peptides (W12359 to W12361) were shown not to be as effective 

AMINO ACID COUNTS: 57 A; 22 R; 20 N; 37 D; 0 B; 3 C; 17 Q; 43 E; 0 Z; 

57 G; 3 H; 43 I; 48 L; 53 K; 18 M; 10 F; 19 P; 24 S; 
34 T; 1 W; 7 Y; 57 V; 

SEQUENCE LENGTH: 573 

SEQUENCE 

1 mlrlptvfrq mrpvsrvlap hltrayakdv kfgadaralm lqgvdllada 
51 vavtmgpkgr tviieqgwgs pkvtkdgvtv aksidlkdky knigaklvqd 
101 vanntneeag dgtttatvla rsiakegfek iskganpvei rrgvmlavda 
151 viaelkkqsk pvttpeeiaq vatisangdk eigniisdam kkvgrkgvit 
201 vkdgktlnde leiiegmkfd rgyispyfin tskgqkcefq dayvllsekk 
251 issiqsivpa leianahrkp lviiaedvdg ealstlvlnr lkvglqwav 
301 kapgfgdnrk nqlkdmaiat ggavfgeegl tlnledvqph dlgkvgeviv 
351 tkddamllkg kgdkaqiekr iqeiieqldv ttseyekekl nerlaklsdg 
401 vavlkvggts dvevnekkdr vtdalnatra aveegivlgg gcallrcipa 
451 ldsltpaned qkigieiikr tlkipamtia knagvegsli vekimqssse 
501 vgydamagdf vnmvekgiid ptkwrtall daagvasllt taevwteip 
551 keekdpgmga mggmgggmgg gmf 
ALIGN Smith-Waterman score: 81 

38 aa overlap starting at 442 
csllicisllqlmvpvntdetieiivenkvkellanpa 



callrcipaldsltpanedq kigiei_ikrtlkipa 



SEARCHED ON 26 OCT 1998 

FILE LAST UPDATED: 18 OCT 1998 <19981018/UP> 
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L2 ANSWER 4 OF 
ACCESSION NUMBER: 
TITLE : 



INVENTOR : 
PATENT ASSIGNEE: 
PATENT INFO: 
APPLICATION INFO: 
PRIORITY INFO: 



PAT. SEQ. LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE : 
OTHER SOURCE: 
CROSS REFERENCES: 
DESCRIPTION: 
KEYWORD: 



9 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
97P-W01657 Protein DGENE 

Compsns. for treating or preventing insulin-dependent 
diabetes - based on T cells specific for 65 kD 
heat-shock protein 
Cohen I R; Elias D; Markovits D 
(YEDA)YEDA RES & DEV CO LTD 

961126 29 pp 

890314 - 
910829 
890314 
890626 
900314 
931112 
3A-3B 



US 
US 
US 
US 



557830-3 A 
89-322864 
91-751448 
89-322864 



US 89-371249 
US 90-493127 
US 93-151052 
Disclosure; 



Fig 



25 APR 1997 (first entry) 
Patent 
English 
97-020369 [02] 
N-PSDB: 97N-T58403 
Human heat shock protein 65 

Heat shock protein 65; Hsp65; insulin-dependent 
diabetes mellitus; IDDM; autoimmune disease; diagnosis; 
therapy; T cell; vaccine 
ORGANISM: Homo sapiens 

ABSTRACT: 

The human heat shock protein 65 (Hsp65) (W01657) is expressed in 
the islets of the pancreas. The T cell response to Hsp65 is 
associated with the development of insulin-dependent diabetes 
mellitus (IDDM) . A method for detecting the existence of, a 
tendency to develop, or the initiation of a process leading to IDDM 
involves detecting the presence of Hsp65 or antibodies or T cells 
reactive with the protein. Hsp65, when administered to a 
tolerogenic carrier, can be used to prevent or treat IDDM prior to 
development of clinical symptoms. Attenuated T cells can be used 
to vaccinate against autoimmunity to Hsp65 and to abort IDDM 
AMINO ACID COUNTS: 56 A; 22 R; 21 N; 35 D; 0 B; 3 C; 18 Q; 40 E; 0 Z; 

58 G; 2 H; 41 I; 49 L; 54 K; 18 M; 11 F; 19 P; 24 S; 

60 V; 



34 T; 
573 



1 W; 



7 Y; 

SEQUENCE LENGTH: 
SEQUENCE 

1 mlrlptvfrq mrpvsrvlap hltrayakdv kfgadaralm lqgvdllada 
51 vavtmgpkgr tviieqswgs pkvtkdgvtv aksidlkdky knigaklvqd 
101 vanntnegag dgtttatvla rsiakegfek iskganpvei rrgvmlavda 
151 viaetkkqsk pvttpeeiaq vatisangdk eigniisdam kkvgrkgvit 
201 vkdgktlnde leiiegmkfd rgyispyfin tskgqkcefq dayvllsekk 
251 issiqsivpa leianlvlnr lkvglqwav kapgflvlnr lkvglqwav 
301 kapgfgdnrk nqlkdmaiat ggavfgeegl tlnledvqph dlgkvgeviv 
351 tkddamllkg kgdkaqiekr iqeiieqldv ttseyekekl nerlaklsdg 
401 vavlkvggts dvevnekkdr vtdalnatra aveegivlgg gcallrcipa 
451 ldsltpaned qkigieiikr tlkipamtia knagvegsli vekimqssse 
501 vgydamagdf vnmvekgiid ptkwrtall daagvasllt taevwteip 
551 keekdpgmga mggmgggmgg gmf 



SEARCHED ON 26 OCT 1998 

FILE LAST UPDATED: 18 OCT 1998 <19981018/UP> 



5 




FEATURE TABLE: 



Key 


I Location | Qualifier | 


Peptide 


11.. 26 | label 


I Leader_peptide 




1 | note 


I "putative mitochondrial 




1 1 


I targeting sequence" 


Protein 


127.. 573 I label 


|Mat_protein 




I | note 


| "amino acid residues 266-285 




1 1 


I differ. from the translated 




1 1 


I sequence 




1 1 


I (LVLNRLKVGLQWAVKAPGF) 


Region 


1557. .5721 note 


I "keratin-like region contg. 




1 1 


IGly-Gly-Met repeats"" 


ALIGN Smith- 


-Waterman score: 81 




38 aa 


overlap starting at 


442 



csllicisl lqlmvpvntdet ie i ivenkvke 1 lanpa 



callrcipaldsltpanedq kigiei_ikrtlkipa 
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L2 ANSWER 5 OF 
ACCESSION NUMBER : 
TITLE : 



INVENTOR: 
PATENT ASSIGNEE: 

(INSP) 
PATENT INFO: 
APPLICATION INFO: 
PRIORITY INFO: 

PAT. SEQ. LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE: 
OTHER SOURCE: 
DESCRIPTION: 
KEYWORD : 



9 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
95P-R67385 Protein DGENE 

DNA from Helicobacter pylori and Helicobacter felis 
used to develop prods, for detection, treatment and 
prevention of Helicobacter infection 
Ferrero R; Labigne A; Suerbaum S; Thiberge J 
(INRM)INST NAT SANTE & RECH MEDIC ALE 
INST PASTEUR 

941124 . 168 pp 

940519 
930519 
931119 

7A(i-vii) 



WO 9426901 A 
WO 94-EP1625 
EP 93-401309 
WO 93-EP3259 
Disclosure; Fig 



22 JUN 1995 (first entry) 

Patent 

English 

95-006797 [01] 

Mitochondrial protein PI 

Urease; immunogen; vaccine; diagnostic; heat shock 
protein; HSP; GroEL-like protein; Helicobacter felis 
Homo sapiens 



ORGANISM: 
ABSTRACT: 

The sequence of the Helicobacter pylori heat shock protein A (given 
in R67374) was compared to that of other GroEL-like proteins from 
Legionella pneumophila (R67381) , Escherichia coli (R67382), 
Chlamydia psittaci (R67383), Mycobacterium leprae (R67384) and 
human mitochondrial protein PI (R67385) , and regions of homology 
were identified 

17 R; 21 N; 37 D; 0 B; 3 C; 17 Q; 40 E; 



0 Z; 

H; 41 I; 46 L; 52 K; 16 M; 10 F; 16 P; 26 S; 
W; 7 Y; 57 V.- 



AMINO ACID COUNTS: 52 A; 

56 G; 1 

31 T; 1 
SEQUENCE LENGTH: 547 
SEQUENCE 

1 ymadvkfgad aralmlqgvd lladavavtm gpkgrtviie qswgspkvtk 
51 dgvtvaksid lkdkykniga klvqdvannt neeagdgttt atvlarsiak 
101 egfekiskga npveirrgvd lavdaviael kkqskpvttp eeiaqvatis 
151 angdkeigni isdamkkvgr kgvitvkdgk tlndeleiie gmkfdrgyis 
201 pyfintskgq kcefqdayvl lsekkissiq sivpaleian lvlnrlkvgl 
251 qwavkapgf lvlnrlkvgl qwavkapgf gdnrknqlkd maistggsvf 
301 geegltlnle dvqphdlgkv gevivtkdda mllkgkgdka qiekriqeii 
351 eqldvttsey ekeklnerla klsdgvavlk yggtsdvevn ekkdrvtdal 
401 natraaveeg ivlgggcall rcipaldslt panedqkigi eiikrtlkip 
451 amtiaknagv dgslivekim qsssevgyda magdfvnmve kgiidptkw 
501 rtalldaasv asllttaew vteipeekdp gmgamggmgg gmgggmf 
ALIGN Smith-Waterman score: 81 

38 aa overlap starting at 417 
csllicisllqlmvpvntdetieiivenkvkellanpa 



callrcipaldsltpanedq kigiei_ikrtlkipa 



SEARCHED ON 26 OCT 1998 

FILE LAST UPDATED: 18 OCT 1998 <19981018/UP>. 
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REFERENCE: Derwent DGene Search Report 

FRANZ-BACON, et al., USSN: 09/099,898 
Atty. Docket No.: DX0744K 



Mouse C2b 

MKTTTCSLL IC I SLLQLMVPVNTEGTLES I VEKKVKELLANRDDC PSTVTKTFSCTS I TASGRLASC PSGMTVTGC AC 
GYGCGSWDIRDGNTCHCQCSTMDWATARCCQLA 

13 ANSWERS FOUND ABOVE A THRESHOLD OF 67 



Similarity 
Score 

76 



38 



Answer Count 



10 



20 



30 



40 



50 



SEARCHED ON 26 OCT 1998 

FILE LAST UPDATED: 18 OCT 1998 <19981018/UP> 



1 



L4 ANSWER 1 OF 
ACCESSION NUMBER: 
TITLE : 



INVENTOR : 
PATENT ASSIGNEE: 
PATENT INFO: 
APPLICATION INFO: 
PRIORITY INFO: 
PAT. SEQ. LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE : 
OTHER SOURCE: 
CROSS REFERENCES: 
DESCRIPTION: 
KEYWORD: 



13 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 

96P-W05534 Peptide DGENE 

New gClq receptor-based, HIV-1 gp 120 binding 

peptide (s) - for preventing and treating HIV-1 

infection 

Fung M S C; Kim Y W; Sun B N V; Sun C R Y; Yu L 
( TANO -N ) TANOX BIOSYSTEMS INC 
WO 9630400 Al 961003 53 pp 

WO 96-US3905 960322 
US 95-410360 950324 
Disclosure; Page 46-48 
17 JAN 1997 (first entry) 
Patent 
English 
96-455274 [45] 
N-PSDB: 96N-T41465 
gClq receptor 

gClq-R; human immunodeficiency virus 
gpl20; immunogen; vaccine; therapy; 



ORGANISM: 
ABSTRACT: 

The gClq receptor 



gClq receptor; 
type 1; HIV-1; 
diagnosis 
Homo sapiens 



(gClq-R) (W05534), a receptor for Clq complement, 
binds to HIV-1 gpl20 and neutralises the infectivity of HIV-1. The 
binding site for gpl20 has been identified (see also W05532) . The 
receptor exists on a variety of cell types, including B cells, T 
cells, monocytes, macrophages, neutrophils, eosinophils, platelets, 
fibroblasts and endothelial, liver, neural and smooth muscle cells. 
Recombinant, mature gClq-R can be produced in transformed host 
cells (see also T41465) . It is useful for detecting or quantifying 
HIV-1 gpl20, HIV-1 virions or HIV-1 infected cells, and can also be 
used to treat or prevent HIV-1 infection and to raise 
diagnostic or therapeutic value 



antibodies of 



AMINO ACID COUNTS 



:17 A; 
20 G; 
16 T; 
282 



15 

5 

3 



R; 
H; 
W; 



9 
9 
4 



N; 
I; 
Y; 



20 D; 
31 L; 
18 V; 



0 B; 
16 K; 



C; 
M; 



10 
13 



Q; 
F; 



29 
17 



E; 

P; 



0 
20 



Z; 
S; 



SEQUENCE LENGTH: 
SEQUENCE 

1 mlpllrcvpr vlgssvaglr aaapaspfrq llqpaprlct rpfgllsvra 
51 gserrpgllr prgpcacgcg cgslhtdgdk afvdflsdei keerkiqkhk 
101 tlpkmsggwe lelngteakl vrkvagekit vtfninnsip ptfdgeeeps 
151 qgqkveeqep eltstpnfw eviknddgkk alvldchype devgqedeae 
201 sdifsirevs fqstgesewk dtnytlntds ldwalydhlm dfladrgvdn 
251 tfadelvels talehqeyit fledlksfvk sq 



FEATURE TABLE: 



Key 


I Location 


I Qualifier 


1 


Peptide 


11. .73 


I label 


I Pre -propeptide 


Protein 


174. .282 


I label 


|Mat_protein 


Modif ied_site 


1114 


I label 


IGlycosylation 




1 


I note 


I "potential N-glycosylation 




1 


1 


Isite" 


Modif ied_site 


1136 


I label 


IGlycosylation 




1 


Inote 


I "potential N-glycosylation 



SEARCHED ON 26 OCT 1998 

FILE LAST UPDATED: 18 OCT 1998 <19981018/UP> 



2 



Modif ied_site 1223 



Binding_site 



1239 



250 I label 
I note 



I label 
I note 



IGlycosylation 

I "potential N-glycosylation 

Isite" 

Igpl20_binding_site 
I "Claim 1" 



ALIGN Smith-Waterman score: 85 

47 aa overlap starting at 36 

pstvtktfsctsitasgrlascpsgmtvtg_cacgygcgswdirdgn 



prlctrpfgllsvra_gse_rrpgllrprgpcacgcgcgslh_tdgd 



SEARCHED ON 26 OCT 1998 

FILE LAST UPDATED: 18 OCT 1998 <19981018/UP> 
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ANSWER 2 OF 13 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 



ACCESSION NUMBER: 
TITLE : 

INVENTOR: 
PATENT ASSIGNEE: 
PATENT INFO: 

APPLICATION INFO: US 88-160416 
PRIORITY INFO: US 92-983 647 

US 88-160416 
US 89-379076 
US 90-553759 
US 93-139273 

PAT. SEQ. LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE : 
OTHER SOURCE: 
CROSS REFERENCES: 
DESCRIPTION: 
KEYWORD : 



96P-R91446 Protein DGENE 

Cloning of cDNA encoding cell surface antigen - useful 
for isolation of diagnostic and therapeutic proteins 
Aruffo A; Seed B 
(GEHO)GEN HOSPITAL CORP 

US 5506126 A 960409 79 pp" 

880225 
921201 - 
880225 
890713 
900713 
931018 

Example 16; Column 85-88 
31 OCT 1996 (first entry) 
Patent 
English 
96-200279 [20] 
N-PSDB: 96N-T14726 
Human CD53 antigen 

Cell surface antigen; cloning; immunoselection; 
immunotherapy; therapy; diagnosis; vector; COS; CD53; 
lymphocyte 
Homo sapiens 



ORGANISM: 
ABSTRACT: 

Human antigen CD53 (R91446) is a type III integral membrane protein 
that may be involved in the transport of factors essential for cell 
proliferation. Its amino acid sequence was deduced from a cDNA 
clone (T14726) obtd. using a novel immunoselection cloning 
technique. CD53 was expressed in transfected COS cells. Anti-CD53 
antibodies are a useful tool for the identification of 
haematopoietic neoplasms, and may prove helpful for identifying 
morphologically poorly defined cells 
AMINO ACID COUNTS: 10 A; 3 R; 11 N; 6 D; 0 B; 12 C; 

33 L; 
14 V; 



SEQUENCE LENGTH: 
SEQUENCE 

1 



17 G; 
10 T; 
219 



5 
4 



H; 
W; 



22 I; 
7 Y; 



10 K; 6 M; 



5 Q; 
16 F; 



E; 
P; 



0 Z; 
18 S; 



mgmsslkllk yvlfffnllf wicgccilgf giyllihnnf gvlfhnlpsl 
51 tlgnvfvivg siimwaflg cmgsikenkc llmsffilll iillaevtla 
101 illfvyeqkl neyvakgltd sihryhsdns tkaawdsiqs flqccgingt 
151 sdwtsgppas cpsdrkvegc yakarlwfhs nflyigiiti cvcvievlgm 
201 sfaltlncqi dktsqtigl 



FEATURE TABLE: 
Key 



I Location | Qualifier I 



Domain 
Domain 
Domain 

Modif ied_site 
Modif ied_site 
Domain 



I 8.. 36 I label 
I 55.. 75 | label 
I 81.. 106 | label 
1149. .151 1 label 
1228. .230|label 
1 182 . .206 | label 



I Hydrophobic_domain 
I Hydrophobic_domain 
I Hydrophobic_domain 
|Glycosylation_site 
|Glycosylation_site 
I Hydrophobic_domain 



SEARCHED ON 26 OCT 1998 

FILE LAST UPDATED: 18 OCT 1998 <19981018/UP> 



4 



ALIGN Smith-Waterman score : 73 

96 aa overlap starting at 75 

mktttcsll_icisllqlmvpvntegtlesivekkvkella nrddcp 



ikenkcllmsf f illliillaevtlaillfvyeqklneyvakgltdsihryhsdnstkaa 
_stvtktf sctsi t asgrlascpsgmtvtgc 

m m 9 m * V • •» * * m • * ■ « » • * • • 

wds iqs f lqccg ing t sdwt sgppas cpsdrkvegc 
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ANSWER 3 OF 13 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 



ACCESSION NUMBER: 
TITLE: 



INVENTOR : 



PATENT ASSIGNEE: 
PATENT INFO: 
APPLICATION INFO: 
PRIORITY INFO: 

PAT. SEQ. LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE : 
OTHER SOURCE: 
CROSS REFERENCES: 
DESCRIPTION: 
KEYWORD : 



94P-R63790 Protein DGENE 

New xylanase enzymes from Aspergillus aculeatus - used 
for degrading plant cell wall components, e.g. in the 
prepn. of feed, in baking and in prepn. of pulp or 
paper 

Andersen L N; Christgau S; Dalboge H; He'ldt-hansen H P; 
Jacobsents; Kauppinen M S; Kofod L V; Mullertz A; Munk 
N; Si J Q 

(NOVO) NOVO -NORDISK AS 

WO 9421785 A 940929 80 pp 

WO 94-DK88 940302 

DK 93-268 930310 

DK 93-1151 931014 

Claim 10; Page 57 

07 JUN 1995 (first entry) 

Patent 

English 

94-317006 [39] 

N-PSDB: 94N-Q74637 

Aspergillus aculeatus xylanase II 

Xylanase II; Aspergillus aculeatus; 

alpha-arbino-pyranosidase; brewing; paper pulp; food 
preparation; plant cell wall degradation 
Aspergillus aculeatus 



ORGANISM: 
ABSTRACT : 

Q74637 encodes R63790 Aspergillus aculeatus xylanase II, which 
degrades plant cell wall components and reduces the viscosity of 
plant cell wall derived material. These properties are useful in 
the production of dough and baked products; in the preparation of 
feed, food, beer, wine, pulp and paper; and for the separation of 
cereal components. In addition xylanase II exhibits 



alpha-arbino-pyranosidase activity 
production of antibodies 
AMINO ACID COUNTS: 54 A; 6 R; 22 N; 

39 G; 
64 T; 
453 



and it can also be used in the 



9 H; 
11 W; 



21 D; 
38 L; 
33 V; 



0 B; 8 C; 
18 K; 4 M; 
2 Others; 



20 Q; 10 E; 
13 F; 16 P; 



17 I; 
20 Y; 

SEQUENCE LENGTH: 
SEQUENCE 

1 mvgllsitaa laatvlpniv savgldqaav akglqyfgta tdnpeltdip 
51 yvtqlnntad fgqitpgnsm kwdatepsqg tftftkgdvi adlaegngqy 
101 lrchtlvwyn qlpswvtsgt wtnatltaal knhitnwsh ykgkclhwdv 
151 vnealnddgt yrtnifytti geayipiafa aaaaadpdak lfyndynley 
201 ggakaasara ivqlvknaga kidgvglqah fsvgtvpsts slvsvlqsft 
251 algvevayte advrillptt attlaqqssd fqalvqscvq ttgcvgftiw 
301 dwtdkyswvp stfsgygaal pwdenlvkkp ayngllagmg vtvttttttt 
351 tatatgkttt tttgatstgt taahwgqcgg lnwsgptaca tgytctyvnd 
401 yysqclxsia qpkpagvlai qsvrfiyhnt qnslldlxkk ktlehtggrs 
451 smh 



0 Z; 
28 S; 



FEATURE TABLE: 

Key | Location | Qual if ier | 

Misc_dif ference | 442 Inote I "corresponding codon codes 

I I ILys" 

Misc_difference|443 . .453 Inote I "no corresponding codons" 



SEARCHED ON 26 OCT 1998 

FILE LAST UPDATED: 18 OCT 1998 <19981018/UP> 
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ALIGN Smith-Waterman score: 89 
64 aa overlap starting at 342 

tvtktf sctsitasgrlascpsgmtvtg_cacgyg_cgswdirdgntchc qcstmdwa 



tvttttttttatatgkttttttgatstgttaahwgqcgglnwsgptacatgytctyvndy 
tare 

ysqc 
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FILE LAST UPDATED: 18 OCT 1998 <19981018/UP> 
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L4 ANSWER 4 OF 
ACCESSION NUMBER: 
TITLE : 



INVENTOR: 
PATENT ASSIGNEE: 

(SANO) 

(SANO) 
PATENT INFO: 
APPLICATION INFO: 
PRIORITY INFO: 
PAT. SEQ . LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE : 
OTHER SOURCE: 
DESCRIPTION: 
KEYWORD: 



13 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
93P-R45352 protein DGENE 

New polypeptide (s) from venom of Plectreurys tristis - 

used as insecticidal agents which are paralytic and/or 

toxic to insects at low concns 

Leisy D J; Quistad G B; Skinner W S 

(SANO)SANDOZ AG 

SANDOZ PATENT GMBH 

SANDOZ-ERFINDUNGEN VERW -GES MBH 

EP 556160 A 930818 50 pp 

EP 93-810078 930208 

US 92-837194 920211 

Claim 1; Page 42 

04 FEB 1994 (first entry) 

Patent 

English 

93-260398 [33] 

Arachnoidal plectoxin polypeptide G 

Primitive Hunting Spider; insecticidal; plectoxin; 

arachnid 

Plectreurys tristis 



ORGANISM: 
ABSTRACT : 

This polypeptide corresponds to a plectoxin sequence isolated from 
the Primitive Hunting Spider (Plectreurys tristis) . The plectoxins 
have insecticidal properties and are used against Lepidopteran 
species. See also R38840 and R45351 
AMINO ACID COUNTS:3 A; 2 R; 2 N; 1 D; 0 B; 10 C; 1 Q; 1 E; 0 

0 H; 1 I; 3 L; 7 K; 0 M; 2 F; 6 P; 5 S; 
2 V; 



G; 
T; 
49 



H; 1 I; 
W; 0 Y; 

SEQUENCE LENGTH: 
SEQUENCE 

1 gckgflvkcd snseccktai vkgkkkqlsc lcgawgagcs csfrcgnrc 
ALIGN Smith-Waterman score: 76 

32 aa overlap starting at 16 
c_tsitasg_rlascpsgmtvtgcacgygcgs 



Z; 
1 



cktaivkgkkkqlsclcgawgagcscsf rcgn 



SEARCHED ON 26 OCT 1998 

FILE LAST UPDATED: 18 OCT 1998 <19981018/UP> 
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ANSWER 5 OF 13 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 



ACCESSION NUMBER: 
TITLE : 



INVENTOR: 
PATENT ASSIGNEE: 
PATENT INFO: 
APPLICATION INFO: 
PRIORITY INFO: 
PAT. SEQ. LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE : 
OTHER SOURCE: 
CROSS REFERENCES: 
DESCRIPTION: 
KEYWORD: 



92P-R20818 Protein DGENE 

New CD53 cell surface antigen and DNA encoding it - for 

inununo- therapy and diagnosis of haematopoietic 

neoplasms, etc 

Seed B; Aruffo A; Amiot M 

(GEHO-N) GEN HOSPITAL CORP 

WO 9201049 A 920123 160 pp 

WO 90-US4986 900715 - 

US 90-553759 900713 

Claim 4; Page 123 

21 MAY 1992 (first entry) 

Patent 

English 

92-056864 [07] 

N-PSDB: 92N-Q21187 

CD53 haematopoietic antigen 

Rapid immunoselection cloning technique; cell surface 
antigen; haematopoietic neoplasm; type III integral 
membrane protein 
Homo sapiens 



ORGANISM: 
ABSTRACT : 

A cDNA clone encoding CD53 was obtained using the rapid 
immunoselection cloning method (see e.g Q21164 for description of 
method) . The cDNA libraries were prepared from the promyelocytic 
tumour cell line HL60 and from peripheral blood lymphocytes and 
transfected into COS cells. The first of the four predicted 
hydrophobic regions is atypically long for either a signal sequence 
or a simple transmembrane alpha-helix. Both cysteine and glycine 
have been found to precede the signal cleavage site (Von Heijne, 
Nucleic Acid Res. 14:4683 (1986)) and the presence of 3 cysteines 
and a glycine in the middle of the first hydrophobic region 
suggests that the N-terminus of the mature CD53 protein begins 
there 

AMINO ACID COUNTS: 10 A; 



0 B; 
10 K; 



12 C; 
6 M; 



5 Q; 
16 F; 



E; 
P; 



3 R; 11 N; 6 D; 
17 G; 5 H; 22 I; 33 L; 
10 T; 4 W; 7 Y; 14 V; 
SEQUENCE LENGTH: 219 
SEQUENCE 

1 mgmsslkllk yvlfffnllf wicgccilgf giyllihnnf gvlfhnlpsl 
51 tlgnvfvivg siimwaflg cmgsikenkc llmsffilll iillaevtla 
101 illfvyeqkl neyvakgltd sihryhsdns tkaawdsiqs fl'qccgingt 
151 sdwtsgppas cpsdrkvegc yakarlwfhs nflyigiiti cvcvievlgm 
201 sfaltlncqi dktsqtigl 



0 Z; 
18 S; 



FEATURE TABLE: 
Key 



I Location | Qualifier | 



Region 
Region 
Region 
Region 

Modif ied_site 
Modif ied_site 



|8. .36 | label 
I 55.. 75 | label 
I 81.. 106 | label 
1182. .206 I label 
1149. .151 | label 
I | note 

1168. .170 I label 
I | note 



I hydrophobic 

I hydrophobic 

I hydrophobic 

I hydrophobic 

|N-iinked_glycosylation 

I "putative" 

|N-linked_glycosylation 
I "putative" 



SEARCHED ON 26 OCT 1998 
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ALIGN Smith-Waterman score: 73 

96 aa overlap starting at 75 

mktttcsll_icisllqlmvpvntegtlesivekkvkella nrddcp 



ikenkcl lms ffillliil laevt lai 1 1 f vyeqklney vakgl tds ihrynsdns tkaa 
_stvtktfsctsi t asgrlascpsgmtvtgc 



wdsiqsf lqccgingtsdwtsgppascpsdrkvegc 



SEARCHED ON 26 OCT 1998 

FILE LAST UPDATED: 18 OCT 1998 <19981018/UP> 
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REFERENCE: Derwent DGene Search Report 

FRANZ-BACON, et al., USSN: 09/099,898 
Atty. Docket No.: DX0744K 



Human CIO 

MGPSSCLLLILIPLLQLINPGSTQCSLDSVMDKKIKDVLNSLEYSPSPISKKLSCASVKSQGRPSSCPAGMAVTGCAC 
GYGCGSWDVQLETTCHCQCSWDWTTARCCHLT 

25 ANSWERS FOUND ABOVE A THRESHOLD OF 67 - 

Similarity 
Score 

94 Ml 



47 



Answer Count 



10 



20 



30 



40 



50 
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L6 ANSWER 1 OF 25 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD , 
ACCESSION NUMBER: 98P-W60854 Protein DGENE 

TITLE: New bovine polypeptide that activates mammalian B 

cell(s) - used e.g. to treat T cell immunodeficiency or 
allergy, as vaccine adjuvant, as T cell surrogate for 
infants, and for monoclonal antibody production, also 
specific antibodies for treating B cell hyperactivity 

INVENTOR: Alizadeh-Khiavi K; Filipp D; Julius M H 

PATENT ASSIGNEE: (WELL-N) WELLESLEY HOSPITAL FOUND 

PATENT INFO: WO 9822580 A2 980528 64 pp 

APPLICATION INFO: WO 97-CA880 971118 

PRIORITY INFO: US 96-746883 961118 

PAT. SEQ. LOC: Claim 14; Fig 7 

DATA ENTRY DATE: 01 OCT 1998 (first entry) 

DOCUMENT TYPE: Patent 

LANGUAGE : Eng 1 i sh 

OTHER SOURCE: 98-312466 [27] 

CROSS REFERENCES: N-PSDB: 98N-V37228 

DESCRIPTION: Human CD14 protein 

KEYWORD: CD14; B cell activator; bovine lactation-associated 

imraunotropic protein; LAIT; T cell immunodeficiency; 
X-linked hyper-IgM syndrome; allergy; common variable 
immunodeficiency; X-linked agammaglobulinaemia; 
vaccine; infant feeding formulae; human 

ORGANISM: Homo sapiens 

ABSTRACT : 

This sequence is the human CD14 protein of the invention. The CD14 
protein was used to isolate the bovine CD14 of the invention, which 
is able to activate mammalian B cells. The protein is also known as 
bovine lactation-associated immunotropic protein (LAIT), and is 
used to activate B cells, particularly in humans. Particularly it 
is administered to subjects: (a) with CD40 negative or deficient B 
cells; (b) suffering from T cell immunodeficiency (e.g. X-linked 
hyper-IgM syndrome, common variable immunodeficiency or X-linked 
agammaglobulinaemia) or allergy (i.e. with CD40 ligand negative or 
defective T cells); or (c) to induce growth and differentiation of 
B cells to highly productive Ig secreting cells. Particular 
applications are in infant feeding formulae (as immunostimulant) 
and as adjuvant in vaccines (optionally with bovine CD14 coupled to 
the antigen). The DNA sequences are also used to enrich mammalian B 
cells secreting a monoclonal antibody (MAb) of particular antigenic 
specificity, by activating cells with sub-optimal amount of the DNA 
in combination with the antigen. The enriched B cells are then used 
to produce hybridomas that produce specific MAb. Antibodies raised 
against human CD14 are used to reduce/ inhibit activity of B cells 
that are hyperactivated by high serum levels of CD14 . Bovine CD14 
stimulates growth (induce DNA synthesis) in resting murine spleen 
cells and is 200 times more effective than lipopolysaccharide 
(LPS) , with the effect unaffected by presence of serum. It also 
induces Ig secretion and a partial isotype switch from IgM to IgG, 
in absence of T cells 

AMINO ACID COUNTS:43 A; 22 R; 15 N; 16 D; 0 B; 10 C; 14 Q; 20 E; 0 Z; 

24 G; 7 H; 4 I; 62 L; 9 K; 6 M; 10 F; 30 P; 29 S; 
18 T; 5 W; 3 Y; 28 V; 

SEQUENCE LENGTH: 375 
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FILE LAST UPDATED: 18 OCT 1998 <19981018/UP> 
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SEQUENCE 

1 merascllll llplvhvsat tpepceldde dfrcvcnfse pqpdwseafq 

51 cvsaveveih agglnlepfl krvdadadpr qyadtvkalr vrrltvgaaq 

101 vpaqllvgal rvlaysrlke ltledlkitg tmpplpleat glalsslrlr 

151 nvswatgrsw laelqqwlkp glkvlsiaqa hspafsyeqv rafpaltsld 

201 lsdnpglger glmaalcphk fpaiqnlalr ntgmetptgv caalaaagvq 

251 phsldlshns lratvnpsap rcmwssalns lnlsfagleq vpkglpaklr 
301 vldlscnrln rapqpdelpe vdnltldgnp flvpgtalph egsmnsgwp 
351 acarstlsvg vsgtlvllqg argfa - 



FEATURE TABLE: 

Key | Location IQualifier I 



Misc_dif ference | 87 Inote I" encoded by TGC" 

ALIGN Smith-Waterman score: 94 
58 aa overlap starting at 1 

mgpssclllilipllqlinpgstqcsldsvmdkkikdvlnsleyspspiskklscasv 



merascllll llplvhvsattpepceld dedf rcvcnf sepqpd_wseafqcvsa 
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L6 ANSWER 2 OF 25 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
ACCESSION NUMBER: 98P-W41693 Protein DGENE 

TITLE: Assay for lipo : poly : saccharide binding inhibitors - 

useful in the treatment of septic shock and other 
lipo:poly : saccharide-mediated disorder(s) 

INVENTOR: Mintz D N; Tobias P; Ulevitch R 

PATENT ASSIGNEE: (SCRI) SCRIPPS RES INST 

PATENT INFO: US 5705398 A 980106 21 pp 

APPLICATION INFO: US 94-205719 940302 - 

PRIORITY INFO: 1 US 94-205719 940302 

PAT. SEQ. LOC: Claim 9; Columns 19-22 

DATA ENTRY DATE: 01 MAY 1998 (first entry) 

DOCUMENT TYPE: Patent 

LANGUAGE : Eng 1 i sh 

OTHER SOURCE: 98-086145 [08] 

CROSS REFERENCES: N-PSDB: 98N-V05505 

DESCRIPTION: Human CD14 

KEYWORD: Identification; binding inhibitor; lipopolysaccharide; 

LPS; CD14; binding protein; LBP; monocyte receptor; 

treatment; septic shock; human 
ORGANISM: Homo sapiens 

ABSTRACT: 

The present sequence was used in the development of a novel method 
for identifying a compound that inhibits the binding of a 
lipopolysaccharide (LPS) to a LPS-binding protein (LBP), or 
LBP-dependent binding of LPS to monocyte receptor CD14 . The method 
comprises measuring the fluorescence emitted by a reaction mixture 
containing f luoresceinated LPS, isolated LBP and optionally CD14 in 
the presence and absence of the compound, and identifying the 
compound as an inhibitor if the fluorescence emitted by the 
reaction mixture containing the compound is less than that emitted 
by the reaction mixture that does not contain the compound. The 
method can be used to identify drugs useful for treating septic 
shock and related LPS-mediated disorders 

AMINO ACID COUNTS:42 A; 23 R; 15 N; 16 D; 0 B; 11 C; 14 Q; 20 E; 0 Z; 

24 G; 7 H; 4 I; 62 L; 9 K; 6 M; 10 F; 30 P; 29 S; 
18 T; 5 W; 2 Y; 28 V; 

SEQUENCE LENGTH: 375 

SEQUENCE 

1 merascllll llplvhvsat tpepceldde dfrcvcnfse pqpdwseafq 
51 cvsaveveih agglnlepfl krvdadrdpr qyadtvkalr vrrltvgaaq 
101 vpaqllvgal rvlaysrlke ltledlkitg tmpplpleat glalsslrlr 
151 nvswatgrsw laelqqwlkp glkvlsiaqa hspafsceqv rafpaltsld 
201 lsdnpglger glmaalcphk fpaiqnlalr ntgmetptgv caalaaagvq 
251 phsldlshns lratvnpsap rcmwssalns lnlsfagleq vpkglpaklr 
301 vldlscnrln rapqpdelpe vdnltldgnp flvpgtalph egsmnsgwp 
351 acarstlsvg vsgtlvllqg argfa 
ALIGN Smith-Waterman score: 94 

58 aa overlap starting at 1 

mgpssclllilipllqlinpgstqcsldsvmdkkikdvlnsleyspspiskklscasv 



merascllllllplvhvsattpepceld dedf rcvcnf sepqpd_wseafqcvsa 
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FILE LAST UPDATED: 18 OCT 1998 <19981018/UP> 
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L6 



ANSWER 3 OF 25 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 



ACCESSION NUMBER: 
TITLE : 



INVENTOR: 
PATENT ASSIGNEE: 
PATENT INFO: 
APPLICATION INFO: 
PRIORITY INFO: 



PAT. SEQ. LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE : 
OTHER SOURCE: 
CROSS REFERENCES: 
DESCRIPTION: 
KEYWORD: 



96P-W05316 Protein DGENE 

Recombinant DNA encoding myelomonocytic differentiation 
antigen CD14 - used for producing recombinant CD14 for 
use in e.g. diagnosis of myeloid disorders such as 
leukaemia 
Goyert S M 
( GOYE- I ) GOYERT 

US 5543303 A 960806 . 11 pp 

US 88-276794 
US 88-276794 
US 90-536163 
US 91-787763 
US 92-916806 
US 93-165583 
Claim 1; Fig 
03 JAN 1997 
Patent 
English 
96-370638 [37] 

N-PSDB: 96N-T39716; N-PSDB; 96N-T39717 
Myelomonocytic differentiation antigen CD14 
Myelomonocytic differentiation antigen; CD14; myeloid 
leukaemia; diagnosis 
Homo sapiens 



S M 

960806 

881128 

881128 

900608 

911106 

920722 

931213 



(first entry) 



ORGANISM : 
ABSTRACT : 

Human myelomonocytic differentiation antigen CD14 (W05316) is an 
antigen useful in the diagnosis of mature myeloid leukemia. Its 
amino acid sequence was deduced from a cDNA clone (T39717) obtd. by 
screening COS 7 cell transf ectants with monoclonal antibodies to 
CD14. Large amts. of CD14 can be produced by expression in 
transformed host cells; mature, glycosylated CD14 is produced in 
mammalian host cells, and nonglycosylated CD14 in prokaryotic hosts 



AMINO ACID COUNTS: 43 A; 

24 G; 
18 T; 
375 



22 R; 15 N; 16 D; 0 B; 11 C; 14 Q; 20 E; 
7 H; 4 I; 62 L; 9 K; 6 M; 10 F; 30 P; 
5 W; 2 Y; 28 V; 

SEQUENCE LENGTH 
SEQUENCE 

1 merascllll llplvhvsat tpepceldde dfrcvcnfse pqpdwseafq 
51 cvsaveveih agglnlepfl krvdadadpr qyadtvkalr vrrltvgaaq 
101 vpaqllvgal rvlaysrlke ltledlkitg tmpplpleat glalsslrlr 
151 nvswatgrsw laelqqwlkp glkvlsiaqa hspafsceqv rafpaltsld 
201 lsdnpglger glmaalcphk fpaiqnlalr ntgmetptgv caalaaagvq 
251 phsldlshns lratvnpsap rcmwssalns lnlsfagleq vpkglpaklr 
301 vldlscnrln rapqpdelpe vdnltldgnp flvpgtalph egsmnsgwp 
351 acarstlsvg vsgtlvllqg argfa 



0 Z; 
29 S; 



FEATURE TABLE: 

Key | Location | Qualifier I 



Peptide 
Modified site 



11.. 19 | label 
I 37 . .39 | label 
I | note 

I I 
Modif ied_site 1 151 .. 153 I label 
I | note 



I Sig_peptide 

I Gly cosy lat ion 

I "potential N-linked 

I glycosylation site" 

IGlycosylation 

I "potential N-linked 
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Modif ied_site 



266. .268 I label 
I note 



Iglycosylation site" 
IGlycosylation 



Modif ied_site 



282. .284 | label 
I note 



I "potential N-linked 
Iglycosylation site" 
IGlycosylation 



Modif ied_site 



323 . .325 I label 
I note 



I "potential N-linked 
Iglycosylation site" 
I Glycosy lat ion 



I "potential N-linked 
Iglycosylation site" 



ALIGN Smith-Waterman score: 94 

58 aa overlap starting at 1 

mgpssclllilipllqlinpgstqcsldsvmdkkikdvlnsleyspspiskklscasv 



merascllllllplvhvsattpepceld dedf rcvcnf sepqpd_wseaf qcvsa 



SEARCHED ON 26 OCT 1998 

FILE LAST UPDATED: 18 OCT 1998 <19981018/UP> 



L6 ANSWER 4 OF 
ACCESSION NUMBER: 
TITLE : 



INVENTOR: 
PATENT ASSIGNEE: 
PATENT INFO: 
APPLICATION INFO: 
PRIORITY INFO: 
PAT. SEQ. LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE: 
OTHER SOURCE: 
CROSS REFERENCES: 
DESCRIPTION: 
KEYWORD: 



ORGANISM: 
ABSTRACT: 

This sequence represents the human 380-400 kD merosin major 
subunit. Merosin is an isoform of laminin and shows structural and 
sequence similarity to the human laminin A chain. Mature human 
merosin is 3 0 amino acids larger than the human laminin A chain. 
Similarly to all laminin chains, the merosin protein has distinct 
domains which are predicted to have globular regions, cysteine-rich 
rod-like regions and helical structures (see features table). 
Merosin has a large globular domain at the carboxy terminal end. 
The merosin protein has an apparent mol . wt . of about 800 kD and is 
composed of four polypeptides with molecular weights of 300, 200, 
200 and 80 kD. The 300 kD polypeptide is joined to the 200 kD 
polypeptides by disulphide bonds and the 300 and 80 kD polypeptides 
comprise the major subunit protein given in R71729. Merosin is 
found in placenta, straited muscle, peripheral nerve, trophoblasts 
and human Schwann cell neoplasms, amoung other tissues. The 
380-400 major merosin subunit also yields a 65 kD subunit. The 
380-400 merosin subunit has been designated merosin polypeptide, 
merosin subunit, M chain or laminin M chain. Fragments of the 
merosin protein may be used as antigens to raise anti-merosin 
antibodies. These antibodies may be used in the detection of 
merosin, as the absence of merosin in a tumour sample indicates 
malignancy. Contacting a neurone with merosin promotes neurite 
outgrowth. The merosin polypeptide may also be used in contacting 
inhibitors of neurite outgrowth, thereby also promoting the 
outgrowth. Merosin also promotes cell attachment. The merosin 
gene has been mapped to chromosome 6, more precisely to bands 
6q22->q23 

AMINO ACID COUNTS:205A; 159R; 162N; 183D; 0 B; 162C; 119Q; 202E; 0 Z; 

261G; 71 H; 1661; 246L; 184K; 46 M; 103F; 173P; 194S; 

193T; 29 W; 96 Y; 156V; 
SEQUENCE LENGTH: 3110 




25 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
95P-R71730 Protein DGENE 

New merosin fragments, corresp. DNA and antibodies - 
for diagnosing tumour malignancy, promoting or 
inhibiting neurite growth and promoting cell attachment 
Engvall E; Leivo I 
(LJOL-N) LA JOLLA CANCER RES FOUND 
WO 9508628 A2 950330 65 pp 

WO 94-US10730 940921 
US 93-125077 930922 
Claim 5; Fig 6 
01 MAY 1996 (first entry) 
Patent 
English 
95-139597 [18] 

N-PSDB: 95N-Q86480 AND T17419 
Merosin major subunit 

Human; 380-400 kD; merosin; major subunit; placenta; 
straited muscle; peripheral nerve; trophoblast; Schwann 
cell neoplasm; 65 kD subunit; 80 kD subunit; merosin 
polypeptide; merosin subunit; M chain; laminin M chain; 
antigen; antibody; detection; tumour; malignancy; 
neurite outgrowth; inhibitor; cell attachment 
Homo sapiens 
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SEQUENCE 

1 mpgaagvlll lllsgglggv qaqrpqqqrq sqahqqrglf pavlnlasna 
51 littnatcge kgpemycklv ehvpgqpvrn pqcricnqns snpnqrhpit 
101 naidgkntww qspsikngie yhyvtitldl qqvfqiayvi vkaansprpg 
151 nwilersldd veykpwqyha vtdtecltly niyprtgpps yakddevict 
201 sfyskihple ngeihislin grpsaddpsp elleftsary irlrfqrirt 
251 lnadlmmfah kdpreidpiv trryyysvkd isvggmcicy gharacpldp 
301 atnksrcece hntcgdscdq ccpgfhqkpw ragtfltkte ceacnchgka 
351 eecyydenva rrnlslnirg kyigggvcin ctqntaginc etctdgffrp 
401 kgvspnyprp cqpchcdpig slnevcvkde kharrglapg schcktgfgg 
451 vscdrcargy tgypdckacn csglgskned pcfgpcicke nveggdcsrc 
501 ksgffnlqed nwkgcdecfc sgvsnrcqss ywtygkiqdm sgwyltdlpg 
551 rirvapqqdd ldspqqisis naearqalph syywsapapy lgnklpavgg 
601 qltftisydl eeeeedterv lqlmiilegn dlsistaqde vylhpseeht 
651 nvlllkeesf tihgthfpvr rkefmtvlan lkrvllqity sfgmdaifrl 
701 ssvnlesavs yptdgsiaaa vevcqcppgy tgsscescwp rhrrvngtif 
751 ggicepcqcf ghaescddvt geclnckdht ggpycdkclp gfygeptkgt 
801 sedcqpcacp lnipsnnfsp tchldrslgl icdgcpvgyt gprcercaeg 
851 yfgqpsvpgg scqpcqcndn ldfsipgscd slsgsclick pgttgrycel 
901 cadgyfgdav dakncqpcrc naggsfsevc hsqtgqcecr anvqgqrcdk 
951 ckagtfglqs argcvpcncn sfgsksfdce esgqcwcqpg vtgkkcdrca 
1001 hgyfnfqegg ctacecshlg nncdpktgrc icppntigek cskcapntwg 
1051 hsittgckac ncstvgsldf qcnvntgqcn chpkfsgakc tecsrghwny 
1101 prcnlcdcfl pgtdattcds etkkcscsdq tgqctckvnv egihcdrcrp 
1151 gkfgldaknp lgcsscycfg tttqcseakg lirtwvtlka eqtilplvde 
1201 alqhtttkgi vfqhpeivah mdlmredlhl epfywklpeq fegkklmayg 
1251 gklkyaiyfe areetgfsty npqviirggt pthariivrh maapligqlt 
1301 rheiemteke wkyygddprv hrtvtredfl dilydihyil ikatygnfmr 
1351 qsriseisme vaeqgrgttm tppadliekc dcplgysgls ceaclpgfyr 
1401 lrsqpggrtp gptlgtcvpc qcnghsslcd petsicqncq hhtagdfcer 
1451 calgyygivk glpndcqqca cplisssnnf spscvaegld dyrctacprg 
1501 yegqycerca pgytgspgnp ggscqececd pygslpvpcd pvtgfctcrp 
1551 gatgrkcdgc khwharegwe cvfcgdectg lllgdlarle qmvmsinltg 
1601 plpapykmly glenmtqelk hllspqrape rliqlaegnl ntlvtemnel 
1651 ltratkvtad geqtgqdaer tntrakslge fikelardae avnekaikln 
1701 etlgtrdeaf ernleglqke idqmikelrr knletqkeia edelvaaeal 
1751 lkkvkklfge srgeneemek dlrekladyk nkvddawdll reatdkirea 
1801 nrlfavnqkn mtalekkkea vesgkrqien tlkegndild eanrladein 
1851 siidyvediq tklppmseel ndkiddlsqe ikdrklaekv sqaeshaaql 
1901 ndssavldgi ldeaknisfn ataafkaysn ikdyideaek vakeakdlah 
1951 eatklatgpr gllkedakgc lqksfrilne akklandvke nedhlnglkt 
2001 rienadarng dllrtlndtl gklsaipndt aaklqavkdk arqandtakd 
2051 vlaqitelhq nldglkknyn kladsvaktn awkdpsknk iiadadatvk 
2101 nleqeadrli dklkpikele dnlkknisei kelinqarkq ansikvsvss 
2151 ggdcirtykp eikkgsynni wnvktavad nllfylgsak fidflaiemr 
2201 kgkvsflwdv gsgvgrveyp dltiddsywy rivasrtgrn gtisvraldg 
2251 pkasivpsth hstsppgyti ldvdanamlf vggltgklkk adavrvitft 
2301 gcmgetyfdn kpiglwnfre kegdckgctv spqvedsegt iqfdgegyal 
2351 vsrpirwypn istvmfkfrt fsssallmyl atrdlrdfms veltdghikv 
2401 sydlgsgmas wsnqnhndg kwksftlsri qkqanisivd idtnqeenia 
2451 tsssgnnfgl dlkaddkiyf gglptlrnls mkarpevnlk kysgclkdie 
2501 isrtpynils spdyvgvtkg cslenvytvs fpkpgfvels pvpidvgtei 
2551 nlsfstknes giillgsggt papprrkrrq tgqayyvill nrgrlevhls 
2601 tgartmrkiv irpepnlfhd grehsvhver trgiftvqvd enrrymqnlt 
2651 veqpievkkl fvggappefq psplrnippf egciwnlvin svpmdfarpv 
2701 sfknadigrc ahqklreded gaapaeiviq pepvptpafp tptpvlthgp 
2751 caaesepall igskqfglsr nshiaiafdd tkvknrltie levrteaesg 
2801 llfymaainh adfatvqlrn glpyfsydlg sgdthtmipt kindgqwhki 
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2851 kimrskqegi lyvdgasnrt ispkkadild wgmlyvggl pinyttrrig 
2901 pvtysidgcv rnlhmaeapa dleqptssfh vgtcfanaqr gtyfdgtgfa 
2951 kavggfkvgl dllvefefat ttttgvllgi ssqkmdgmgi emideklmfh 
3001 vdngagrfta vydagvpghl cdgqwhkvta nkikhrielt vdgnqveaqs 
3051 pnpastsadt ndpvfvggfp ddlkqfgltt sipfrgcirs lkltkgtash 
3101 wrlilprpwn 



FEATURE TABLE: 



Key 




I Location 


Qualifier 




Region 




|1.. 2298 Inote |"region encoded by Q86480" 


Peptide 




11. .22 


note 


"Signal peptide" 


Domain 




123 . .286 


label 


Domain VI 






1 


note 


"predicted to form globular 






1 




structure" 


Modified. 


.site 


155. .57 


note 


"N-linked glycosylation site" 


Modif ied_ 


.site 


189. .91 


note 


"N-linked glycosylation site" 


Domain 




1287. .527 


label I 


Domain V 






1 


note 


"contains four and one half 






1 




Cystein-rich EGF-like repeats. 






1 




predicted to have rigid 






1 




rod- like structure" 


Modif ied_ 


.site 


1303 . .305 


note 


"N-linked glycosylation site" 


Modified. 


.site 


1363. .365 


note 


"N-linked glycosylation site" 


Modified. 


.site 


1380. .382 


note 


"N-linked glycosylation site" 


Modified. 


.site 


1470. .472 


note 


"N-linked glycosylation site" 


Domain 




1528. .723 


label 


Domain IVb 






1 


note 


"predicted to form globular 






1 




structure" 


Domain 




1724. .1175 


1 label 


Domain Illb 






1 


1 note 


"contains ten and one half 






1 




Cystein-rich EGF-like repeats. 






1 




predicted to have rigid 










rod- like structure" 


Modified. 


.site 


1746. .748 


I note 


"N-linked glycosylation site" 


Modified. 


.site 


11061. .1063 


I note 


"N-linked glycosylation site" 


Domain 




11176. .1379 


I label 


Domain IVa 








1 note 


"predicted to form globular 










structure" 


Domain 




11180. .1573 


I label 


Domain Ilia 








I note 


"contains four Cystein-rich 










EGF-like repeats, predicted 










to have rigid rod-like 










structure" 


Domain 




11574. .2153 


I label 


Domain I+II 








I note 


"forms two B-type chains, 










forms triple coiled-coil 










structure" 


Modified. 


.site 


11597. .1599 


1 note 


"N-linked glycosylation site" 


Modified. 


.site 


11614. .1616 


I note 


"N-linked glycosylation site" 


Modified. 


.site 


11700. .1702 


1 note 


"N-linked glycosylation site" 


Modified. 


.site 


11810. .1812 


I note 


"N-linked glycosylation site" 


Modified. 


.site 


11901. .1903 


I note 


"N-linked glycosylation site" 


Modified. 


.site 


11916. .1918 


I note 


"N-linked glycosylation site" 


Modified. 


.site 


11920. .1922 


I note 


"N-linked glycosylation site" 


Modified. 


.site 


12017. .2019 


I note 


"N-linked glycosylation site" 


SEARCHED 


ON 26 
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Modif ied_site 
Modif ied_site 
Modif ied_site 
Domain 



Modif ied_site 
Modif ied_site 
Modif ied_site 
Modif ied_site 
Modif ied_site 
Modif ied_site 
Modif ied_site 
Modif ied_site 
Modif ied_s 



12028. ,. 2030 Inote 
12045. .2047 Inote 
12126. .2128|note 
12154. ,3110 I label 
I Inote 

I I 
12240. .2242 Inote 
12360. .2362 Inote 
12435. .2437 Inote 
12478. .2480|note 
12551. .2553 Inote 
12558. .2560 Inote 
12648. .2650 Inote 
12868. .2870 Inote 



Lsite 12893 . .2895 Inote 



"N-linked glycosylation site" 
■N-linked glycosylation site" 
"N-linked glycosylation site" 
Domain G 

" forms large globule at end of 
laminin long arm" 
"N-linked glycosylation site" 
"N-linked glycosylation site" 
"N-linked glycosylation site" 
"N-linked glycosylation site" 
"N-linked glycosylation site" 
"N-linked glycosylation site" 
"N-linked glycosylation site" 
"N-linked glycosylation site" 
I "N-linked glycosylation site" 



ALIGN Smith-Waterman score: 97 

64 aa overlap starting at 939 
c_asvksqgrpsscpag mavtgca 



_c_gy gcgs wdvq 1 e 1 1 chcqc s wdwt t a 



cranvqgq_rcdkckagtfglqsargcvpcncnsfgsksfdceesgqcwcqpgvtgkkcd 
rcch 



rcah 
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L6 ANSWER 5 OF 
ACCESSION NUMBER: 
TITLE: 



INVENTOR : 
PATENT ASSIGNEE : 
PATENT INFO: 
APPLICATION INFO: 
PRIORITY INFO: 

PAT. SEQ. LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE : 
OTHER SOURCE: 
CROSS REFERENCES: 
DESCRIPTION: 
KEYWORD: 



25 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
90P-R07450 protein DGENE 

DNA encoding TNF binding protein and TNF- receptor - 
used in tumour treatment and to understand mechanismsm 
to TNF act ion 

Hauptmann R; Himmler A; Maurer-Fogy I; Stratowa C 

(BOEH) BOEHRINGER INGELHEIMINT 

EP 393438 A 901024 51 pp 

EP 90-106624 900406 

DE 89-3920282 890621 

DE 89-3913101 890421 

Disclosure; Fig 8(1-2) 

29 JAN 1991 (first entry) 

Patent 

German 

90-321987 [43] 
N-PSDB: 90N-Q06284 

Rat Tumour Necrosis Factor-Receptor from raTNF-R8 cDNA 
Tumour necrosis factor binding protein; TNF-BP; 
TNF-receptor; raTNF-R8 
Rat rattus 



ORGANISM: 
ABSTRACT: 

A rat brain cDNA analogue of the HS913T cDNA library from rat 
glioma cell line C6 (ATCC CCL107) is prepared in lambda-gtll. The 
isolated clone raTNF-R8 is used as probe to isolated the entire 
human TNF receptor, as represented in Q06285. See also 
Q06282-Q06285 

AMINO ACID COUNTS: 22 A; 26 R; 20 N; 16 D; 0 B; 31 C; 16 Q; 25 E; 0 Z; 

28 G; 14 H; 16 I; 48 L; 20 K; 9 M; 17 F; 44 P; 31 S; 

29 T; 5 W; 8 Y; 36 V; 
SEQUENCE LENGTH: 461 . 

SEQUENCE 

1 mglpivpgll lslvllallm gihpsgvtgl vpslgdrekr dnlcpqgkya 
51 hpknnsicct kchkgtylvs dcpspgqetv celshkgtft asqnhvrqcl 
101 scktcrkemf qveispckad mdtvcgckkn qfqrylseth fqcvdcspcf 
151 ngtvtipcke kqntvcncha gfflsgnect pcshckknqe cmklclppva 
201 nvtnpqdsgt avllplvifl glcllffici sllcrypqwr prvysiicrd 
251 sapvkevege givtkpltpa sipafsanpg fnptlgfstt prfshpvsst 
301 pispvfgpsn whnfvppvre wptqgadpl lygclnpvpi papvrkwedv 
351 vaaqpqrldt adpamlyaw dgvpptrwke fmrllglseh eierlelqng 
401 rclreahysm leawrrrtpr deatldwgr vlcdmnlrgc leniretles 
451 pahsstthlp r 
ALIGN Smith-Waterman score: 89 

99 aa overlap starting at 7 

pssclllilipllqlinpgstqcsldsvmdkkikdvlnsleyspspiskklscasv_ksq 



pglllslvllal lmg ihpsgvtg 1 vps lgdr ekrdn 1 cpqgkyahpknns i cc t kchkg t 
grpsscpagmavtgcacgygcgswdvqlettchc_qcsv 



y 1 vsdcpspgqe t vce 1 shk_g t f t a s qnhvr qc 1 s ckt 
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REFERENCE: Derwent DGene Search Report 

FRANZ-BACON, et al., USSN: 09/099,898 
Atty. Docket No.: DX0744K 



Mouse C18 



MKPTLCFLFILVSLFPLIVPGNAQCSFESLVDQRIKEALSRQEPKTISCTSVTSSGRLASCPAGMWTGCACGYGCGS 
WDIRNGNTCHCQCSVMDWASARCCRMA 

30 ANSWERS FOUND ABOVE A THRESHOLD OF 65 



Similarity 
Score 

80 



40 



Answer Count 



10 



20 



30 



40 



50 
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L8 ANSWER 1 OF 30 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
ACCESSION NUMBER: 98P-W44299 Protein DGENE 

TITLE: Human serrate-2 gene expression products - used to 

regulate stem cell differentiation, useful in treating 
neoplasms, e.g. leukaemia 

INVENTOR: I ton A; Sakano S 

PATENT ASSIGNEE: ( ASAH) ASAHI KASEI KOGYO KK 

PATENT INFO: WO 9802458 Al 980122 103 pp 

APPLICATION INFO: WO 97-JP2414 970711 - 

PRIORITY INFO: JP 97-124063 970514 

JP 96-186220 960716 

PAT. SEQ. LOC: Claim 3; Page 62-68 

DATA ENTRY DATE: 19 JUN 1998 (first entry) 

DOCUMENT TYPE: Patent 

LANGUAGE : J apane s e 

OTHER SOURCE: 98-110528 [10] 

CROSS REFERENCES: N-PSDB: 98N-V15181 

DESCRIPTION: Human serrate 2 

KEYWORD: Human; serrate 2; regulation; stem cell; 

differentiation; neoplasm; leukaemia; endothelial cell; 
tumour 

ORGANISM: Homo sapiens 

ABSTRACT : 

The present sequence represents human serrate 2 . The present 
invention also describes a method for the preparation of the 
polypeptides, and antibodies binding to the polypeptide and its 
fragments. The polypeptide and its fragments expressed by the 
serrate-2 gene can be used to inhibit stem (especially blood stem) 
cell differentiation and to inhibit endothelial cell growth. They 
may be incorporated in a cell culture media for culturing 
undifferentiated stem cells. They can also be used for treatment of 
neoplasms such as leukaemia. The antibodies can be used for the 
diagnosis of malignant tumours 

AMINO ACID COUNTS:87 A; 76 R; 59 N; 76 D; 0 B; 130C; 31 Q; 70 E; 0 Z; 

143G; 34 H; 30 I; 72 L; 33 K; 5 M; 35 F; 88 P; 75 S; 
50 T; 26 W; 30 Y; 62 V; 

SEQUENCE LENGTH: 1212 

SEQUENCE 

1 mgyfelqlsa lrnvngells gaccdgdgrt traggcghde cdtyvrvclk 
51 eyqakvtptg pcsyghgatp vlggnsfylp pagaagdrar araraggdqd 
101 pglwipfqf awprsftliv eawdwdndtt pneellierv shagminped 
151 rwkslhfsgh vahlelqirv rcdenyysat cnkfcrprnd ffghytcdqy 
201 gnkacmdgwm gkeckeavck qgcnllhggc tvpgecrcsy gwqgrfcdec 
251 vpypgcvhgs cvepwqcnce tnwggllcdk dlnycgshhp ctnggtcina 
301 epdqyrctcp dgysgrncek aehactsnpc anggschevp sgfechcpsg 
351 wsgptcaldi decasnpcaa ggtcvdqvdg fecicpeqwv gatcqldane 
401 cegkpclnaf scknliggyy cdcipgwkgi nchinvndcr gqcqhggtck 
451 dlvngyqcvc prgfggrhce lerdkcassp chsgglcedl adgfhchcpq 
501 gfsgplcevd vdlcepspcr ngarcynleg dyycacpddf ggkncsvpre 
551 pcpggacrvi dgcgsdagpg mpgtaasgvc gphgrcvsqp ggnfscicds 
601 gftgtychen iddclgqpcr nggtcidevd afrcfcpsgw egelcdtnpn 
651 dclpdpchsr grcydlvndf ycacddgwkg ktchsrefqc daytcsnggt 
701 cydsgdtfrc acppgwkgst cavaknsscl pnpcvnggtc vgsgasfsci 
751 crdgwegrtc thntndcnpl pcynggicvd gvnwfrceca pgfagpdcri 
801 nidecqsspc aygatcvdei ngyrcscppg ragprcqevi gfgrscwsrg 
851 tpfphgsswv edcnscrcld grrdcskvwc gwkpcllagq pealsaqcpl 
901 gqrclekapg qclrppceaw gecgaeepps tpclprsghl dnncarltlh 
951 fnrdhvpqgt tvgaicsgir slpatravar drllvllcdr assgasalev 
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1001 avsfspardl pdssliqgaa haivaaitgr gnsslllavt evkvetwtg 
1051 gsstgllvpv lcgafsvlwl acwlcvwwt rkrrkerers rlpreesann 
1101 qwaplnpirn pierpgghkd vlyqcknftp pprradealp gpaghaavre 
1151 deededlgrg eedsleaekf lshkftkdpg rspgrpahwa sgpkvdnrav 
1201 rsinearyag ke 
ALIGN Smith-Waterman score: 94 
105 aa overlap starting at 253 

fplivpgn a qcsfes lvdqrikealsrqepkti_sctsvtssgrlascpag 

. : ii. • : i ■ i • i i . . • : -» • • •"! • • • .ill 
ypgcvhgscvepwqcncetnwggllcdkdlnycgshhpctnggtcinaepdqyrctcpdg 
mwtgcacg_ygcgswdirngntch cqcsvmdwasarc 

ysgrncekaehactsnpcanggschevpsgf echcp_sgwsgptc 
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L8 ANSWER 2 OF 30 DGENE COPYRIGHT 1998 DERWENT. INFORMATION LTD 
ACCESSION NUMBER: 98P-W44298 Protein DGENE 

TITLE: Human serrate-2 gene expression products - used to 

regulate stem cell differentiation, useful in treating 

neoplasms, e.g. leukaemia 
INVENTOR: Itoh A; Sakano S 

PATENT ASSIGNEE: (ASAH) AS AH I KASEI KOGYO KK 
PATENT INFO: WO 9802458 Al 980122 103 pp 

APPLICATION INFO: WO 97-JP2414 970711 . 
PRIORITY INFO: JP 97-124063 970514 

JP 96-186220 960716 
PAT. SEQ. LOC: Claim 2; Page 57-62 
DATA ENTRY DATE: 19 JUN 1998 (first entry) 
DOCUMENT TYPE: Patent 
LANGUAGE: Japanese 
OTHER SOURCE: 98-110528 [10] 

CROSS REFERENCES: N-PSDB: 98N-V15181 
DESCRIPTION: Human serrate 2 protein fragment 

KEYWORD: Human; serrate 2; regulation; stem cell; 

differentiation; neoplasm; leukaemia; endothelial cell; 

tumour 

ORGANISM: Homo sapiens 

ABSTRACT: 

The' present sequence represents a human serrate 2 protein fragment. 
The present invention also describes a method for the preparation 
of the polypeptides, and antibodies binding to the polypeptide and 
its fragments. The polypeptide and its fragments expressed by the 
serrate-2 gene can be used to inhibit stem (especially blood stem) 
cell differentiation and to inhibit endothelial cell growth. They 
may be incorporated in a cell culture media for culturing 
undifferentiated stem cells. They can also be used for treatment of 
neoplasms such as leukaemia. The antibodies can be used for the 
diagnosis of malignant tumours 

AMINO ACID COUNTS:72 A; 58 R; 52 N; 68 D; 0 B; 126C; 29 Q; 54 E; 0 Z; 

132G; 30 H; 27 I; 59 L; 24 K; 5 M; 31 F; 73 P; 67 S; 
47 T; 21 W; 28 Y; 52 V; 

SEQUENCE LENGTH: 1055 

SEQUENCE 

1 mgyfelqlsa lrnvngells gaccdgdgrt traggcghde cdtyvrvclk 
51 eyqakvtptg pcsyghgatp vlggnsfylp pagaagdrar araraggdqd 
101 pglwipfqf awprsftliv eawdwdndtt pneellierv shagminped 
151 rwkslhfsgh vahlelqirv rcdenyysat cnkfcrprnd ffghytcdqy 
201 gnkacmdgwm gkeckeavck qgcnllhggc tvpgecrcsy gwqgrfcdec 
251 vpypgcvhgs cvepwqcnce tnwggllcdk dlnycgshhp ctnggtcina 
301 epdqyrctcp dgysgrncek aehactsnpc anggschevp sgfechcpsg 
351 wsgptcaldi decasnpcaa ggtcvdqvdg fecicpeqwv gatcqldane 
401 cegkpclnaf scknliggyy cdcipgwkgi nchinvndcr gqcqhggtck 
451 dlvngyqcvc prgfggrhce lerdkcassp chsgglcedl adgfhchcpq 
501 gfsgplcevd vdlcepspcr ngarcynleg dyycacpddf ggkncsvpre 
551 pcpggacrvi dgcgsdagpg mpgtaasgvc gphgrcvsqp ggnfscicds 
601 gftgtychen iddclgqpcr nggtcidevd afrcfcpsgw egelcdtnpn 
651 dclpdpchsr grcydlvndf ycacddgwkg ktchsrefqc daytcsnggt 
701 cydsgdtfrc acppgwkgst cavaknsscl pnpcvnggtc vgsgasfsci 
751 crdgwegrtc thntndcnpl pcynggicvd gvnwfrceca pgfagpdcri 
801 nidecqsspc aygatcvdei ngyrcscppg ragprcqevi gfgrscwsrg 
851 tpfphgsswv edcnscrcld grrdcskvwc gwkpcllagq pealsaqcpl 
901 gqrclekapg qclrppceaw gecgaeepps tpclprsghl dnncarltlh 
951 fnrdhvpqgt tvgaicsgir slpatravar drllvllcdr assgasalev 
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1001 avsfspardl pdssliqgaa haivaaitqr gnsslllavt evkvetwtg 
1051 gsstg 
ALIGN Smith-Waterman score: 94 

105 aa overlap starting at 253 

fplivpan a qcsfes lvdqrikealsrqepkti_sctsvtssgrlascpag 

• I I Z • • ! 1 ■ I • I • > • •••• 

ypgcvhgscvepwqcncetnwggllcdkdlnycgshhpctnggtcinaepdqyrctcpdg 
mwtgcacg_ygcgswdirngntch ^cqcsvmdwasarc 

• as •••••• * • • - ■•■ a 

ysgrncekaehactsnpcanggschevpsgf echcp_sgwsgptc 
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L8 ANSWER 3 OF 30 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
ACCESSION NUMBER: 96P-W05834 Protein DGENE 

TITLE: Vertebrate Serrate protein and related DNA - used to 

treat or prevent malignancies characterised by 
increased Notch activity 

INVENTOR: Artavanis-Tsakonas S; Gray G E; Henrique D M P; 

Ish-Horowicz D; Lewis J H; Mann R S; Myat A M 

PATENT ASSIGNEE: ( IMCR) IMPERIAL CANCER RES TECHNOLOGY 
(UYYA) UNIV YALE 

PATENT INFO: WO 9627610 Al 960912 161 pp 

APPLICATION INFO: WO 96-US3172 960307 

PRIORITY INFO: US 95-400159 950307 

PAT. SEQ. LOC: Claim 5; Page 104-107 

DATA ENTRY DATE: 28 JAN 1997 (first entry) 

DOCUMENT TYPE: Patent 

LANGUAGE : Eng 1 i sh 

OTHER SOURCE: 96-425379 [42] 

CROSS REFERENCES: N-PSDB: 96N-W05834 

DESCRIPTION: Human Serrate-2 (HJ2) 

KEYWORD: Serrate-2; human jagged-2; HJ2; Notch; cell 

differentiation; cell fate; central nervous system; 
cancer; tissue repair; therapy; diagnosis; antibody 

ORGANISM: Homo sapiens 

ABSTRACT: 

Human Serrate-1 (W05833) and human Serrate-2 (W05833) are ligands 
for the zygotic neurogenic locus Notch, and are believed to play a 
major role in determining cell fates (differentiation) in the 
central nervous system. Their amino acid sequences were deduced 
from cDNA clones (see also T40090-91) isolated from human foetal 
brain cDNA libraries. The proteins, antibodies raised to them, and 
encoding nucleic acids can be used in the detection of Serrate 
sequences and in the treatment of disorders of cell fate or 
differentiation, partic. cancer, nervous system disorders and in 
tissue repair or regeneration 

AMINO ACID COUNTS:85 A; 87 R; 58 N; 64 D; 0 B; 137C; 34 Q; 56 E; 0 Z; 

149G; 34 H; 32 I; 72 L; 36 K; 10 M; 39 F; 90 P; 86 S; 
59 T; 30 W; 28 Y; 71 V; 

SEQUENCE LENGTH: 1257 

SEQUENCE 

1 minpedrwks lhfsghvahl elqirvrcde nyysatcnkf crprndffgh 
51 ytcdqygnka cmdgwmgkec keavckqgcn llhggctvpg ecrcsygwqg 
101 rfcdecvpyp gcvhgscvep wqcncetnwg gllcdkdlny cgshhpctng 
151 gtcinaepdq yrctcpdgys grncekaeha ctsnpcangg schevpsgfe 
201 chcpsgwsgp tcaldideca snpcaaggtc vdqvdgfeci cpeqwvgatc 
251 qldanecegk pclnafsckn liggyycdci pgwkginchi nvndcrgqcq 
301 hggtckdlvn gyqcvcprgf ggrhcelerd kcasspchsg glcedladgf 
351 hchcpqgfsg plcevdvdlc epspcrngar cynlegdyyc acpddfggkn 
401 csvprepcpg gacrvidgcg sdagpgmpgt aasgvcgphg rcvsqpggnf 
451 scicdsgftg tycheniddc lgqpcrnggt cidevdafrc fcpsgwegel 
501 cdtnpndclp dpchsrgrcy dlvndfycac ddgwkgktch srefqcdayt 
551 csnggtcyds gdtfrcacpp gwkgstcava knssclpnpc vnggtcvgsg 
601 asfscicrdg wegrtcthnt ndcnplpcyn ggicvdgvnw frcecapgfa 
651 gpdcrinide cqsspcayga tcvdeingyr cscppgragp rcqevigfgr 
701 scwsrgtpfp hgsswvedcn scrcldgrrd cskvwcgwkp cllagqpeal 
751 saqcplgqrc lekapgqclr ppceawgecg aeeppstpcl prsghldnnc 
801 arltlhfnrd hvpqgttvga icsgirslpa travardrll vllcdrassg 
851 asavevavsf spardlpdss liqgaahaiv aaitqrgnss lllavtevkv 
901 etwtggsst gllvpvlcga fsvlwlacw lcvwwtrkrr kerersrlpr 
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951 eesannqwap lnpirnpier pgghkdvlyq cknftppprr rcpgrpatrp 

1001 sgrmrrtril aavrrtpwrr rssshtnspk ilaarrggrp tgpqapkwtt 

1051 arsgasmrpa tsarevgrlq lgrdpgpsvg ampsagpggr ghvhsffilc 

1101 kkttknknqm fifyvsltly klfsncqaen ngvfsdscyf ckvavrgtrc 

1151 mkgeskgclr rhqivafvtr gcalftessf ysslgflcap gqsagethgc 

1201 vgvahgcwwd pwlmvwpvav ggtrgcqwdl wlsvgptwg gtlvidvala 
1251 agtargc 



FEATURE TABLE: 

Key | Location I Qualifier I 













Domain 


11. .912 


I label 


|Extracellular_domain 




1 
1 
1 
1 
1 




I note 


I "a deletion in the encoding 
IcDNA clone results in loss of 
Ipart of the Serrate-2 signal 
I peptide and beginning of the 
|DSL domain 


Domain 


126. . 

1 

1 

1 

1 


70 


I label 
I note 


IDSL 

I "region of homology with 
IDrosophila Delta and Serrate, 
I predicted to mediate binding 
Iwith Notch" 


Domain 


175. . 

1 

1 


735 


I label 
I note 


IELR 

I "epidermal growth factor-like 
I repeat domain" 


Region 


175. . 


105 


I label 


IELR1 


Region 


1106. 


.140 


I label 


IELR2 


Region 


1141. 


.180 


I label 


IELR3 


Region 


1181. 


.218 


I label 


I ELR4 


Region 


1219. 


.256 


I label 


IELR5 


Region 


1257. 


.294 


I label 


IELR6 


Region 


1295. 


.331 


1 label 


IELR7 


Region 


1332. 


.369 


I label 


I ELR8 


Region 


1370. 


.407 


I label 


IELR9 


Region 


1408. 


.435 


I label 


I Partial_ELR 


Region 


1436. 


.469 


1 label 


I Partial_ELR 


Region 


1470. 


.507 


I label 


IELR10 


Region 


1508. 


.545 


I label 


IELR11 


Region 


1546. 


.584 


I label 


IELR12 


Region 


1585. 


.622 


I label 


IELR13 


Region 


1623. 


.660 


I label 


IELR14 


Region 


1664. 


.701 


I label 


IELR15 


Region 


1702. 


.718 


I label 


I Partial_ELR 


Region 


1719. 


.735 


I label 


I Partial_ELR 


Domain 


1913. 


.933 


1 label 


I Transmembrane_domain 


Domain 


1934. 


.1257 


I label 


I Intracellular_domain 



ALIGN Smith-Waterman score: 94 
105 aa overlap starting at 109 

fplivpgn_a qcsf es lvdqrikealsrqepkti_sctsvtssgrlascpag 

*»• i * • • * • • 

• ■ • • • • •••»• ••■ • • >•■• 

ypgcvhgscvepwqcncetnwggllcdkdlnycgshhpctnggtcinaepdqyrctcpdg 
mwtgcacg_ygcgswdirngntch cqcsvmdwasarc 



ysgrncekaehactsnpcanggschevpsgfechcp_sgwsgptc 
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L8 ANSWER 4 OF 30 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
ACCESSION NUMBER: 98P-W54234 peptide DGENE 

TITLE: Detection and therapy of cervical cancer - using 

specific cervical cancer-associated proteins as targets 
for treatment or as indicators for detection 

INVENTOR: Keesee S K; Obar R; Wu Y 

PATENT ASSIGNEE: (MATR-N)MATRITECH INC 

PATENT INFO: WO 9809170 A2 980305 79 pp 

APPLICATION INFO: WO 97-US14526 970819 

PRIORITY INFO: US 96-705660 960830 

PAT. SEQ. LOC: Claim 12; Page 55-56 

DATA ENTRY DATE: 10 AUG 1998 (first entry) 

DOCUMENT TYPE: Patent 

LANGUAGE : Eng 1 i sh 

OTHER SOURCE: 98-230271 [20] 

DESCRIPTION: . Human TDP-43 protein 

KEYWORD: Cervical cancer-associated protein; CvC; tryptic 

peptide; human; detection; treatment; TDP-43; TAR DNA 
binding protein; non-chromatin protein 

ORGANISM: Homo sapiens 

ABSTRACT : 

This protein is the human TDP-43 protein (also known as TAR DNA 
binding protein) which is used to obtain tryptic peptides which are 
used in a method for detecting cervical, cancer. The method involves 
detecting the presence of a cervical cancer-associated protein 
(CvC) in a tissue or body fluid sample. The CvC is characterised as 
having a molecular weight of 44900-69400 Daltons as determined by 
sodium dodecyl-sulphate (SDS)-PAGE techniques and an isoelectric 
point (pi) of 5.1-6.6 as determined by standard isoelectric 
focusing techniques. The protein is further characterised as being 
a non-chromatin protein which is detectable at a higher level in a 
human cervical cancer cell than in a normal human cervical cell, as 
determined by 2D-gel electrophoresis. The methods can be used for 
the early and rapid detection of cervical cancer, for treating 
cervical cancers and for monitoring the efficacy of such treatment 

AMINO ACID COUNTS:26 A; 20 R; 28 N; 22 D; 0 B; 6 C; 24 Q; 22 E; 0 Z; 

55 G; 5 H; 14 I; 21 L; 20 K; 18 M; 22 F; 16 P; 41 S; 
15 T; 6 W; 8 Y; 25 V; 

SEQUENCE LENGTH: 414 

SEQUENCE 

1 mseyirvted endepieips eddgtvllst vtaqfpgacg lryrnpvsqc 
51 mrgvrlvegi lhapdagwgn lvywnypkd nkrkmdetda ssavkvkrav 
101 qktsdlivlg lpwktteqdl keyfstfgev lmvqvkkdlk tghskgfgfv 
151 rfteyetqvk vmsqrhmidg rwcdcklpns kqsqdeplrs rkvfvgrcte 
201 dmtedelref fsqygdvmdv fipkpfrafa fvtfaddqia qslcgedlii 
251 kgisvhisna epkhnsnrql ersgrfggnp ggfgnqggfg nsrgggaglg 
301 nnqgsnmggg mnfgafsinp ammaaaqaal qsswgmmgml asqqnqsgps 
351 gnnqnqgnmq repnqafgsg nnsysgsnsg aaigwgsasn agsgsgfngg 
401 fgssmdskss gwgm 
ALIGN Smith-Waterman score: 80 

74 aa overlap starting at 224 

kptlcf If ilvslfplivpgnaqcsf eslvdqrikealsrqepktisctsvtssgrlasc 



kpfrafaf v tf addqiaqslca edliikaisvhisnaepkhnsnralersarfaan 
pagmwtgcacgyg 

Pggf gnqggfg 
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INVENTOR: 
PATENT ASSIGNEE: 
PATENT INFO: 
APPLICATION INFO: 
PRIORITY INFO: 
PAT. SEQ. LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE : 
OTHER SOURCE: 
DESCRIPTION: 
KEYWORD : 



L8 ANSWER 5 OF 30 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
ACCESSION NUMBER: 94P-R62046 protein DGENE 

TITLE: Treating insulin-like growth factor related disease - 

by admin, of opt. modified binding protein, also new 
non-phosphorylated binding protein, e.g. for treating 
cancer or restenosis 
Cox G N; Russell DA 
( SYND) SYNERGEN INC 

WO 9422466 A 941013 68 pp 

WO 94-US3755 940406 
US 93-45265 930407 
Disclosure; Page 10 
09 MAY 1995 (first entry) 
Patent 
English 
94-332814 [41] 
Human mature IGFBP-1 

IGFBP-1; insulin-like growth factor binding protein-1; 
IGF-binding protein; BP-1; breast cancer; colon cancer; 
lung cancer; ovary cancer; liver cancer; osteosarcoma; 
glioma; rhabdomyosarcoma; restenosis; acromegaly; 
obesity; diabetic nephropathy; retinopathy 
ORGANISM: Homo sapiens 

ABSTRACT: 

Muteins of human IGFBP-1 (mature protein given in R62046, signal 
peptide in R62047) have been produced containing a S98C or S101C 
substitution. Such muteins can be attached via thiol-reactive 
groups to PEG to form novel therapeutic agents 
AMINO ACID COUNTS: 26 A; 10 R; 9 N; 7 D; 0 B; 18 C; 9 Q; 22 E; 0 Z; 

17 G; 6 H; 9 I; 17 L; 9 K; 3 M; 5 F; 17 P; 21 S; 9 
T; 5 W; 6 Y; 9 V; 
SEQUENCE LENGTH: 234 
SEQUENCE 

1 apwqcapcsa eklalcppvs ascsevtrsa gcgccpmcal plgaacgvat 
51 arcarglscr alpgeqqplh altrgqgacv qesdasapha aeagspespe 
101 steiteeell dnfhlmapse edhsilwdai stydgskalh vtnikkwkep 
151 crielyrwe slakaqetsg eeiskfylpn cnkngfyhsr qcetsmdgea 
201 glcwcvyfwn gkripgspei rgdpncqiyf nvqn 
ALIGN Smith-Waterman score: 87 

37 aa overlap starting at 11 
ealsrqepktisctsvtssgrlascpagmwtgcacg 



eklalcppvsascsevtrsagcgccpmcalplgaacg 
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* 



) 



Mouse C19 

MKNLSFPLLFLFFLVPELLGSSMPLCPIDEAIDKKIKQDFNSLFPNAIKNIGLNCWTVSSRGKLASCPEGTAVLSCSC 
GSACGSWDIREEKVCHCQCARIDWTAARCCKLQVAS 

29 ANSWERS FOUND ABOVE A THRESHOLD OF 68 

Similarity 
Score 



91 



46 



Answer Count 



10 



20 



30 



40 
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L10 ANSWER 1 OF 29 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
ACCESSION NUMBER : 95P-R74680 protein DGENE 

TITLE: Genetically engineered tissue plasminogen activator - 

is modified at positions 44-50 and 296-302 and is 
non-glycosylated in Kl and K2 regions, has extended 
half -life and PAI-1 resistance 

INVENTOR: Huang C; Huang P; Liu S 

PATENT ASSIGNEE: (BIOE-N) BIOENGINEERING INST ACAD MILITARY 

PATENT INFO: CN 1082111 A 940216 . 7 pp 

APPLICATION INFO: CN 93-109234 930806 

PRIORITY INFO: CN 93-109234 930806 

PAT. SEQ. LOC: Claim 2; Fig 1 and Page 5 

DATA ENTRY DATE: 04 JAN 1996 (first entry) 

DOCUMENT TYPE: Patent 

LANGUAGE: Chinese 

OTHER SOURCE: 95-162457 [22] 

DESCRIPTION: t-PA mutein (N117Q, N184Q, delta 296-302, 44-50 

replaced by KPIAEK) 
KEYWORD: Tissue plasminogen activator; tPA; thrombolytic agent; 

mutein; deglycosylated kringle domain; PAI-1 resistance 
ORGANISM: Synthetic 
ABSTRACT: 

The sequences given in R74678-R74689 are examples of preferred 
mutant versions of human tPA. In all the muteins, amino acids 296- 
3 02 of wild-type tPA (involved in interaction with PAI-1) have been 
deleted and the kringle domains have been deglycosylated by substn. 
of Asn 117 in Kl and Asnl84 in K2 by Asp residues. Also, amino 
acids 44-50 of wild-type tPA are replaced by a motif which differs 
between different muteins. The modified tPA proteins have prolonged 
half-life, are resistant to PAI-1 and have affinity for fibrin; 
they are useful as thrombolytic agents 

AMINO ACID COUNTS:33 A; 33 R; 20 N; 28 D; 0 B; 35 C; 28 Q; 27 E; 0 Z; 

42 G; 14 H; 20 I; 39 L; 21 K; 5 M; 16 F; 28 P; 45 S; 
25 T; 13 W; 24 Y; 23 V; 

SEQUENCE LENGTH: 519 

SEQUENCE 

1 syqvicrdek tqmiyqqhqs wlrpvlrsnr veycwcnsgr aqckpiaekc 
51 seprcfnggt cqqalyfsdf vcqcpegfag kcceidtrat cyedqgisyr 
101 gtwstaesga ectnwqssal aqkpysgrrp dairlglgnh nycrnpdrds 
151 kpwcyvfkag kyssefcstp acsegnsdcy fgqgsayrgt hsltesgasc 
201 lpwnsmilig kvytaqnpsa qalglgkhny crnpdgdakp wchvlknrrl 
251 tweycdvpsc stcglrqysq pqfrikgglf adiashpwqa aifaerflcg 
301 gilisscwil saahcfqerf pphhltvilg rtyrwpgee eqkfevekyi 
351 vhkefdddty dndiallqlk sdssrcaqes swrtvclpp adlqlpdwte 
401 celsgygkhe alspfyserl keahvrlyps srctsqhlln rtvtdnmlca 
451 gdtrsggpqa nlhdacqgds ggplvclndg rmtlvgiisw glgcgqkdvp 
501 gvytkvtnyl dwirdnmrp 



FEATURE TABLE: 

Key I Location I Qualifier I 



Domain 1 1 . . 49 I label I f inger_domain 

| |note | "amino acids 44-50 of F domain 

| | | have been replaced by the 

| | | sequence KPIAEK" 

Domain I50..86 | label |E_domain 
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Domain 



Domain 



Domain 



275. .519 



87. .175 



176. .274 



note 

label 

note 



label 
note 



label 
note 



"growth factor domain" 
Kringle_l 

"substn. of Asnll7 (corresp. 
to position 116 in this 
mutein) by Asp destroys an 
N-linked glycosylation site" 
Kringle_2 

"substn. of Asnl84 (corresp. 
to position 183 in this 
mutein) by Asp destroys an 
N-linked glycosylation site" 
P_domain 

"amino acids 296-302 of native 
tPA have been deleted; these 
residues are involved in 
interaction with PAI-1" 



Disulfide. 


_bond 


i 6 • • 


36 


Disulfide. 


_bond 


134. 




43 


Disulfide. 


_bond 


150. 




61 


Disulfide. 


_bond 


155. 




72 


Disulfide. 


_bond 


174. 




83 


Disulfide. 


_bond 


191. 




172 


Disulfide. 


_bond 


1112 




.154 


Disulfide. 


_bond 


1 143 




.167 


Disulfide. 


_bond 


1 179 




.260 


Disulfide. 


_bond 


1200 




.242 


Disulfide. 


.bond 


1231 




.255 


Disulfide. 


_bond 


1263 




.387 


Disulfide. 


_bond 


1299 




.315 


Disulfide. 


_bond 


1307 




.376 


Disulfide. 


_bond 


1401 




.476 


Disulfide 


_bond 


I 433 




.449 



Disulf ide_bond | 466.. 494 | I 

ALIGN Smith-Waterman score: 91 

84 aa overlap starting at 8 

deaidkkikqdfnslfpnaikniglncwtvssrgklascpegtavlscscgsacgswdir 

dektqmiyqqhqswlrpvlrsnrveycwcnsgraqckpiaekcseprcfnggtc_qqaly 
eekvchcqcaridwtaarccklqv 



f sdfv_cqcp egf agkcceidt 
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29 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
97P-W19616 Protein DGENE 

New active mutants of radish antifungal protein 2 - 
used to generate fungus-resistant plants or as 
therapeutic or preservative agents 
Broekaert W F; De Samblanx G W; Rees S B 
( ZENE ) ZENECA LTD 

WO 9721814 Al 970619 39 pp 

WO 96-GB3065 961212 - 
GB 95-25474 951213 
Claim 1; Fig 1 
13 DEC 1997 (first entry) 
Patent 
English 
97-332785 [30] 

Radish antifungal protein 2 (Rs-AFP2) 

Rs-AFP2; radish antifungal protein 2; fungicide; salt 
tolerance; preservative; transgenic plant; crop 
protection 
ORGANISM: Raphanus sativus 

ABSTRACT: 

This polypeptide comprises radish antifungal protein 2 (Rs-AFP2) . 
Novel potent antifungal proteins (see W26371-90) based on Rs-AFP2 
contain at least 1 mutation selected from a basic residue at 
positions 9 or 39, and a hydrophobic residue at positions 5 or 16. 
Proteins containing Gln5Met (see W26379), Glyl6Met (W26380), 
Gly9Arg (W26376) , Val39Arg (W26377) or Gly9Arg plus Val39Arg 
(W26378) mutations are specifically claimed. A cDNA clone encoding 
Rs-AFP2 preprotein can be modified by recombinant DNA methods to 
allow expression of mutant isoforms in yeast as mating factor alpha 
1 fusion proteins. The Rs-AFP2 mutants have enhanced salt tolerant 
antifungal activity, especially when expressed in plant tissue 



L10 ANSWER 2 OF 
ACCESSION NUMBER: 
TITLE : 



INVENTOR: 
PATENT ASSIGNEE: 
PATENT INFO: 
APPLICATION INFO: 
PRIORITY INFO: 
PAT. SEQ. LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE: 
OTHER SOURCE: 
DESCRIPTION: 
KEYWORD: 



where that may 
are useful for 
pharmaceutical 
AMINO ACID COUNTS: 3 

G; 

T; 

SEQUENCE LENGTH: 51 
SEQUENCE 

1 qklcqrpsgt 
51 c 



have curative as well as protective effects. They 
combating fungal diseases in agricultural, 
or preservative applications 

A; 3 R; 5 N; 0 D; 0 B; 8 C; 3 Q; 1 E; 0 
2 H; 2 I; 2 L; 4 K; 0 M; 2 F; 3 P; 3 S; 
1 W; 2 Y; 2 V; 



wsgvcgnnna cknqcirlek arhgscnyvf pahkcicyfp 



Z; 
1 



FEATURE TABLE: 



Key 




I Location I Qualifier I 


Misc. 


.difference 


15 


I note 

1 

1 


I "Gin at position 5 may be 
I replaced by a hydrophobic 
I amino acid, preferably Met"" 


Misc. 


.difference 


19 


I note 

1 

1 


I "Gly at position 9 may be 
I replaced by a basic amino 
lacid, preferably Arg" " 


Misc. 


.difference 


1 16 


I note 

1 

1 


I "Gly at position 16 may be 
I replaced by a hydrophobic 
I amino acid, preferably Met"" 


Misc. 


.difference 


139 


I note 


I "Val at position 39 may be 
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I 
I 



I 
I 



I replaced by a basic amino 
lacid, preferably Arg"" 



ALIGN Smith-Waterman score: 88 
47 aa overlap starting at 2 

klascpegtavlscscgsacgswdireekvchcqcaridwtaarc_c 



klcqrpsgtwsgvcgnnnacknqcirlekarhgscnyv_fpahkcic 
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L10 ANSWER 3 OF 29 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
ACCESSION NUMBER: 97P-W19281 Protein DGENE ' 

TITLE: Antifungal peptide derived from radish antifungal 

protein 2 - and related DNA, useful for producing 
plants with increased fungal resistance and as 
therapeutic or preservative agent 

INVENTOR: Borremans F A M; Broekaert W F; De Samblanx; Fant F; 

Meloen R H; Puijk W C; Rees S B; Schaaper W M M; 
Sijtsma L; Van Amerongen A; Van Gelder W M J 

PATENT ASSIGNEE: ( ZENE ) ZENEC A LTD 

PATENT INFO: WO 9721815 A2 970619 65 pp 

APPLICATION INFO: WO 96-GB3068 961212 
PRIORITY INFO: GB 96-6552 960328 

GB 95-25455 951213 
PAT. SEQ. LOC: Disclosure; Figure 1 
DATA ENTRY DATE: 21 JAN 1998 (first entry) 
DOCUMENT TYPE: Patent 
LANGUAGE: ' English 

OTHER SOURCE: 97-332786 [30] 

DESCRIPTION: Raphanus sativus antifungal protein 2 (Rs-AFP2) 

KEYWORD: Antifungal protein; Candida; fungal resistance; food 

additive; radish; crop protection; plant defensin; 

bacterial protection; preservative 
ORGANISM: Raphanus sativus 

ABSTRACT: 

This protein sequence is the Rhapanus sativus (radish) mature 
antifungal protein (Rs-AFP2) , which is homologous to proteins 
W19280- W19290. Shorter peptides, based on these proteins have been 
produced (see W19291-92, W19294-98, W19301-304, W19330-34 and 
W31765-834) . Plants containing DNA sequences encoding these 
proteins have improved resistance to fungi. Compositions containing 
the peptides can be used to control fungi or bacteria in 
pharmaceutical (e.g. treatment of Candida infections) or 
preservative purposes (as food additives) . In agriculture, the 
peptide may be used to improve disease resistance or disease 
tolerance of crops, either pre or post harvest. When applied to 
plants they may also have curative as well as protective actions. 
The peptides may also be used to protect plants by introducing 
them, or a microorganism capable of expressing the peptide into the 
soil 

AMINO ACID COUNTS: 3 A; 3 R; 5 N; 0 D; 0 B; 8 C; 3 Q; 1 E; 0 Z; 4 

G; 2 H; 2 I; 2 L; 4 K; 0 M; 2 F; 3 P; 3 S; 1 
T; 1 W; 2 Y; 2 V; 

SEQUENCE LENGTH: 51 

SEQUENCE 

1 qklcqrpsgt wsgvcgnnna cknqcirlek arhgscnyvf pahkcicyfp 
51 c 

ALIGN Smith-Waterman score: 88 

47 aa overlap starting at 2 

klascpegtavlscscgsacgswdireekvchcqcaridwtaarc_c 

• • • * * •••••• 

• • ■ ■ •••• • • • • • • • • 

klcqrpsgtwsgvcgnnnacknqcirlekarhgscnyv_fpahkcic 
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L10 ANSWER 4 OF 29 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
ACCESSION NUMBER: 97P-W26380 Protein DGENE 

TITLE: New active mutants of radish antifungal protein 2 - 

used to generate fungus-resistant plants or as 
therapeutic or preservative agents 

INVENTOR: Broekaert W F; De Samblanx G W; Rees S B 

PATENT ASSIGNEE: ( ZENE ) ZENECA LTD 

PATENT INFO: WO 9721814 Al 970619 39 pp 

APPLICATION INFO: WO 96-GB3065 961212 

PRIORITY INFO: GB 95-25474 951213 ~ 

PAT. SEQ. LOC: Claim 2; Page 4 

DATA ENTRY DATE: 13 DEC 1997 (first entry) 

DOCUMENT TYPE: Patent 

LANGUAGE : Eng 1 i sh 

OTHER SOURCE: 97-332785 [30] 

DESCRIPTION: Radish antifungal protein 2 mutant (G16M) 

KEYWORD: Rs-AFP2; radish antifungal protein 2; fungicide; salt 

tolerance; preservative; transgenic plant; crop 

protection 

ORGANISM: Chimeric Raphanus sativus; Chimeric synthetic 

ABSTRACT: 

This polypeptide comprises a specifically claimed Glyl6Met mutant 
of radish antifungal protein 2 (Rs-AFP2) (see also W19616) that 
shows enhanced salt tolerant antifungal activity compared to the 
wild- type protein, especially when expressed in plants. A cDNA 
clone encoding Rs-AFP2 preprotein can be modified by recombinant 
DNA methods to allow expression of the mutant isoform in yeast as a 
mating factor alpha 1 fusion protein. Rs-AFP2 mutants (see also 
W26371-79 and W26381-90) are useful for combating fungal diseases 
in agricultural, pharmaceutical and preservative applications. When 
applied to plants, they may have curative as well as protective 
effects 

AMINO ACID COUNTS: 3 A; 3 R; 5 N; 0 D; 0 B; 8 C; 3 Q; 1 E; 0 Z; 3 

G; 2 H; 2 I; 2 L; 4 K; 1 M; 2 F; 3 P; 3 S; 1 
T; 1 W; 2 Y; 2 V; 

sequence length: 51 
Sequence 

1 qklcqrpsgt wsgvcmnnna cknqcirlek arhgscnyvf pahkcicyfp 
51 c 

ALIGN Smith-Waterman score: 86 
47 aa overlap starting at 2 

klascpegtavlscscgsacgswdireekvchcqcaridwtaarc_c 



klcqrpsgtwsgvcmnnnacknqcirlekarhgscnyv_fpahkcic 
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39 pp 



L10 ANSWER 5 OF 29 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 

97P-W26379 Protein DGENE 

New active mutants of radish antifungal protein 2 
used to generate fungus-resistant plants or as 
therapeutic or preservative agents 
Broekaert W F; De Samblanx G W; Rees SB 
(ZENE) ZENECA LTD 
WO 9721814 Al 970619 
WO 96-GB3065 961212 
GB 95-25474 951213 
Claim 2; Page 4 
13 DEC 1997 (first entry) 
Patent 
English 
97-332785 [30] 

Radish antifungal protein 2 mutant 
Rs-AFP2; radish antifungal protein 
tolerance; preservative; transgenic plant; crop 
protection 

Chimeric Raphanus sativus; Chimeric synthetic 



ACCESSION NUMBER: 
TITLE : 



INVENTOR : 
PATENT ASSIGNEE: 
PATENT INFO: 
APPLICATION INFO: 
PRIORITY INFO: 
PAT. SEQ. LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE: 
OTHER SOURCE: 
DESCRIPTION: 
KEYWORD : 



(Q5M) 

2; fungicide; salt 



ORGANISM: 
ABSTRACT: 

This polypeptide comprises a specifically claimed Gln5Met mutant of 
radish antifungal protein 2 (Rs-AFP2) (see also W19616) that shows 
enhanced salt tolerant antifungal activity compared to the wild- 
type protein, especially when expressed in plants. A cDNA clone 
encoding Rs-AFP2 preprotein can be modified by recombinant DNA 
methods to allow expression of the mutant isoform in yeast as a 
mating factor alpha 1 fusion protein. Rs-AFP2 mutants (see also 
W26371-78 and W26380-90) are useful for combating fungal diseases 
in agricultural, pharmaceutical and preservative applications. When 
applied to plants, they may have curative as well as protective 
effects 

AMINO ACID COUNTS: 3 A; 3 R; 5 N; 0 D; 0 B; 8 C; 2 Q; 1 E; 0 Z; 4 

G; 2 H; 2 I; 2 L; 4 K; 1 M; 2 F; 3 P; 3 S; 1 
T; 1 W; 2 Y; 2 V; 

SEQUENCE LENGTH: 51 

SEQUENCE 

1 qklcmrpsgt wsgvcgnnna cknqcirlek arhgscnyvf pahkcicyfp 
51 c 

ALIGN Smith-Waterman score: 86 

47 aa overlap starting at 2 

klascpegtavlscscgsacgswdireekvchcqcaridwtaarc_c 



klcmrpsgtwsgvcgnnnacknqcirlekarhgscnyv_fpahkcic 



\ 



SEARCHED ON 26 OCT 1998 

FILE LAST UPDATED: 18 OCT 1998 <19981018/UP> 
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REFERENCE: Derwent DGene Search Report 

FRANZ-BACON, et al.. USSN: 09/099,898 
Atty. Docket No.: DX0744K 



Rat C19 



MKNLSFLLLFLFFLVLGLLGPSMSLCPMDEAISKKINQDFSSLLPAAMKNTVLHCWSVSSRGRLASCPEGTTVTSCSC 
GSGCGSWDVREDTMCHCQCGS I DWTAARCCTLRVGS 

333 ANSWERS FOUND ABOVE A THRESHOLD OF 68 

Similarity 
Score 

135 | 



68 



Answer Count 70 140 210 280 350 



SEARCHED ON 26 OCT 1998 

FILE LAST UPDATED: 18 OCT 1998 <19981018/UP> 



1 



LI 2 ANSWER 1 OF 333 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
ACCESSION NUMBER: 87P-P70641 protein DGENE 

TITLE: New modified tissue plasminogen activator - with new 

encoding DNA, new DNA expression vector, useful for 
treating vascular disorders, eg, pulmonary embolism 
arterial thrombosis 

INVENTOR: Bang N U; Little S P; Schoner B E; Weigel B J 

PATENT ASSIGNEE: (ELIL)ELI LILLY & CO 

PATENT INFO: AU 8661804 A 870305 164 pp 

APPLICATION INFO: AU 86-61804 860825 

PRIORITY INFO: US 85-769298 850826 

PAT. SEQ. LOC: Claim 9; page 122 

DATA ENTRY DATE: 10 APR 1991 (first entry) 

DOCUMENT TYPE: Patent 

LANGUAGE : Eng 1 i sh 

OTHER SOURCE: 87-108842 [16] 

CROSS REFERENCES: N-PSDB: 87N-N70990 

DESCRIPTION: Modified tissue plasminogen activator 

KEYWORD: Tissue plasminogen activator; kringle domain; embolism; 

thrombosis; stroke; 
ORGANISM: Homo sapiens 

ABSTRACT: 

The modified t-PA has. all/part of the kringle domains of native 
t-PA removed. The t-PA has functional properties superior to those 
of native t-PA. It retains fibrin binding properties and interacts 
more slowly and inefficiently with plasminogen activator 
inhibitor (s) compared to native t-PA. It is obtd. in large amts . 
from a prokaryotic host. Modified t-PA used for treating vascular 
disorders, eg deep vein thrombosis, pulmonary embolism, peripheral 
arterial thrombosis, disseminated intravascular coagulation, emboli 
from the heart or peripheral arteries, acute myocardial infarction, 
thrombotic strokes or fibrin deposits associated with invasive 
cancers. t-PA is used at a dosage of 250000 to 5000000 units at a 
loading dose or in a deep vein thrombosis -pulmonary embolism, or 
250000-7500000 units over 30-90 mins . in acute myocardial 
infarction 

AMINO ACID COUNTS: 18 A; 24 R; 9 N; 19 D; 0 B; 23 C; 22 Q; 19 E; 0 Z; 

28 G; 12 H; 15 I; 30 L; 13 K; 5 M; 13 F; 19 P; 29 S; 

16 T; 7 W; 13 Y; 20 V; 
SEQUENCE LENGTH: 354 
SEQUENCE 

1 mgsyqvicrd ektqmiyqqh qswlrpvlrs nrveycwcts graqchsvpv 
51 kscseprcfn ggtcqqalyf sdfvcqcpeg fagkcceist cglrqysqpq 
101 frikgglfad iashpwqaai fakhrrspge rflcggilis scwilsaahc 
151 fqerfpphhl tvilgrtyrv vpgeeeqkfe vekyivhkef dddtydndia 
201 llqlksdssr caqesslvrt vclppadlql pdwtecelsg ygkhealspf 
251 yserlkeahv rlypssrcts qhllnrtvtd nmlcagdtrs ggpqanlhda 
301 cqgdsggplv clndgrmtlv giiswglgcg qkdvpgvytk vtnyldwird 
351 nmrp 

ALIGN Smith-Waterman score: 135 
93 aa overlap starting at 1 

lgpsmslcpmdeaiskkinqdfssllpaamkntvlhcwsvssrgrlascp_egttvtscs 



mgsyqvic_rdektqmiyqqhqswlrpvlrsnrveycwctsgraqchsvpvkscseprcf 
cgsgcgswdvredtmchcqcgsidwtaarcctl 



nggtcqqalyf sdfvcqcpeg f agkccei 



SEARCHED ON 26 OCT 1998 

FILE LAST UPDATED: 18 OCT 1998 <19981018/UP> 
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L12 ANSWER 2 OF 
ACCESSION NUMBER: 
TITLE : 



INVENTOR: 
PATENT ASSIGNEE: 
PATENT INFO: 
APPLICATION INFO: 
PRIORITY INFO: 
PAT. SEQ. LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE: 
OTHER SOURCE: 
DESCRIPTION: 

KEYWORD: 



333 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
95P-R74682 protein DGENE 

Genetically engineered tissue plasminogen activator - 
is modified at positions 44-50 and 296-302 and is 
non-glycosylated in Kl and K2 regions, has extended 
half-life and PAI-1 resistance 
Huang C; Huang P; Liu S 

(BIOE-N) BIOENGINEERING INST ACAD MILITARY 



940216 
930806 
930806 
1 and Page 5 
(first entry) 



CN 1082111 A 940216 7 pp 

CN 93-109234 
CN 93-109234 
Claim 2; Fig 
04 JAN 1996 
Patent 
Chinese 
95-162457 [22] 
t-PA mutein (N117Q, N184Q, 
replaced by ERHTSVQT) 

Tissue plasminogen activator; tPA; thrombolytic agent; 
mutein; deglycosylated kringle domain; PAI-1 resistance 
Synthetic 



delta 296-302, 44-50 



ORGANISM: 
ABSTRACT: 

The sequences given in R74678-R74689 are examples of preferred 
mutant versions of human tPA. In all the muteins, amino acids 296- 
302 of wild-type tPA (involved in interaction with PAI-1) have been 
deleted and the kringle domains have been deglycosylated by substn. 
of Asn 117 in Kl and Asnl84 in K2 by Asp residues. Also, amino 
acids 44-50 of wild-type tPA are replaced by a motif which differs 
between different muteins. The modified tPA proteins have prolonged 
half-life, are resistant to PAI-1 and have affinity for fibrin; 
they are useful as thrombolytic agents 
AMINO ACID COUNTS:32 A; 34 R; 20 N; 28 D; 0 B; 35 C; 29 Q; 27 E; 

39 L; 
24 V; 



42 G; 
27 T; 
521 



15 
13 



H; 
W; 



19 K; 5 M; 16 F; 27 P; 



19 I; 
24 Y; 

SEQUENCE LENGTH: 
SEQUENCE 

1 syqvicrdek tqmiyqqhqs wlrpvlrsnr veycwcnsgr aqcerhtsvq 
51 tcseprcfng gtcqqalyfs dfvcqcpegf agkcceidtr atcyedqgis 
101 yrgtwstaes gaectnwqss alaqkpysgr rpdairlglg nhnycrnpdr 
151 dskpwcyvfk agkyssefcs tpacsegnsd cyfgqgsayr gthsltesga 
201 sclpwnsmil igkvytaqnp saqalglgkh nycrnpdgda kpwchvlknr 
251 rltweycdvp scstcglrqy sqpqfrikgg lfadiashpw qaaifaerfl 
301 cggilisscw ilsaahcfqe rfpphhltvi lgrtyrwpg eeeqkfevek 
351 yivhkefddd tydndiallq lksdssrcaq esswrtvcl ppadlqlpdw 
401 tecelsgygk healspfyse rlkeahvrly pssrctsqhl lnrtvtdnml 
451 cagdtrsggp qanlhdacqg dsggplvcln dgrmtlvgii swglgcgqkd 
501 vpgvytkvtn yldwirdnmr p 



0 Z; 
46 S; 



FEATURE TABLE: 
Key 



I Locat ion I Qual i fieri 



Domain 



Domain 



11. .51 



I 



I label 
I note 
I 
I 

I 52.. 88 | label 



I f inger_domain " - 

I "amino acids 44-50 of F domain 
I have been replaced by the 
I sequence ERHTSVQT" 
IE domain 



SEARCHED ON 26 OCT 1998 
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3 



Domain 



Domain 



Domain 



Disul 
Disul 
Disul 
Disul 
Disul 
Disul 
Disul 
Disul 
Disul 
Disul 
Disul 
Disul 
Disul 
Disul 
Disul 
Disul 
Di 



fide, 
fide, 
fide, 
fide, 
fide, 
fide, 
fide, 
fide, 
fide, 
fide, 
fide 
fide, 
fide, 
fide, 
fide, 
fide, 
sulf i 



bond 
bond 
bond 
bond 
bond 
bond 
bond 
bond 
bond 
bond 
bond 
bond 
bond 
bond 
bond 
bond 



89. .177 



178. .276 



277. .521 



6. .36 
34. .43 



52. , 
57. . 
76. , 
93 . , 
114, 
145, 
181, 
202. 
233 , 



63 

74 

85 

174 

.156 

.169 

.262 

.244 

.257 



265. .389 
301. .317 
309. .378 
403. .478 
435. .451 



note 

label 

note 



label 
note 



label 
note 



"growth factor domain" 
Kringle_l 

"substn. of Asnll7 (corresp. 
to position 118 in this 
mutein) by Asp destroys an 
N-linked glycosylation site" 
Kringle_2 

"substn. of Asnl84 (corresp. 
to position 185 in this 
mutein) by Asp destroys an 
N-linked glycosylation site" 
P_domain 

"amino acids 296-302 of native 
tPA have been deleted; these 
residues are involved in 
interaction with PAI-1" 



de_bond | 4 68.. 49 6 I 



I 



ALIGN Smith-Waterman score: 134 
87 aa overlap starting at 8 

deaiskkinqdf ssllpaamkntvlhcwsvssrgrlascpegttvtscs. 



_c gsgcg 



dektqmiyqqhqswlrpvlrsnrveycw_cns_gr_aqcerhtsvqtcseprcfnggtcq 
swdvredtmchcqcgsidwtaarcctl 



qalyf sdfvcqcpeg. 



_f agkccei 
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L12 ANSWER 3 OF 333 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 



ACCESSION NUMBER: 
TITLE : 



INVENTOR: 
PATENT ASSIGNEE: 
PATENT INFO: 
APPLICATION INFO: US 95-389615 
PRIORITY INFO: 



98P-W54147 protein DGENE 

Mutant tissue plasminogen activator proteins - useful 
for treating vascular disorders, preventing tissue 
adhesion(s), etc 

Goeddel D V; Leung D W H; Rice G C 
(GETH ) GENENTECH INC 



US 5736135 A 



980407 
950213 
910711 
930126 
940401 
950213 



24 pp 



PAT. SEQ. LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE: 
OTHER SOURCE: 
DESCRIPTION: 
KEYWORD: 



US 91-728456 
US 93-8940 
US 94-221660 
US 95-389615 
Claim 6; Page - 
20 JUL 1998 (first entry) 
Patent 
English 
98-239153 [21] 

t-PA mutant (Y93C, T103A, N184S, G198D) 
Amino acid substitution; t-PA; vascular disorder; 
prevention; fibrin deposition; adhesion formation 
Synthetic 



0 B; 36 C; 26 Q; 
21 K; 5 M; 16 F; 
0 Others ; 



26 E; 
29 P; 



0 

49 



Z; 
S; 



ORGANISM: 
ABSTRACT : 

Mutant tissue plasminogen activator proteins (W54147-W54158) are 
created by single or multiple amino acid substitutions. 
Compositions containing the t-PA variant are used for treating 
vascular disorders, for preventing fibrin deposition or for 
preventing adhesion formation or reformation. Note: This sequence- 
is not given in the specification but was created from the wild 
type by the indexer 

AMINO ACID COUNTS: 33 A; 35 R; 21 N; 29 D; 

42 G; 16 H; 19 I; 39 L; 
24 T; 13 W; 34 Y; 14 V; 

SEQUENCE- LENGTH : 527 

SEQUENCE 

1 syqvicrdek tqmiyqqhqs wlrpvlrsnr veycwcnsgr aqchsvpyks 
51 cseprcfngg tcqqalyfsd fvcqcpegfa gkcceidtra tccedqgisy 
101 rgawstaesg aectnwnssa laqkpysgrr pdairlglgn hnycrnpdrd 
151 skpwcyvfka gkyssefcst pacsegnsdc yfgsgsayrg thsltesdas 
201 clpwnsmili gkvytaqnps aqalglgkhn ycrnpdgdak pwchylknrr 
251 ltweycdyps cstcglrqys qpqfrikggl fadiashpwq aaifakhrrs 
301 pgerflcggi lisscwilsa ahcfqerfpp hhltyilgrt yrwpgeeeq 
351 kfeyekyiyh kefdddtydn diallqlksd ssrcaqessv vrtvclppad 
401 lqlpdwtece lsgygkheal spfyserlke ahvrlypssr ctsqhllnrt 
451 ytdnmlcagd trsggpqanl hdacqgdsgg plyclndgrm tlygiiswgl 
501 gcgqkdypgy ytkvtnyldw irdnmrp 



FEATURE TABLE: 

Key | Location I Qualifier I 



Domain 


11. . 


44 Inote 


I "Finger domain" 


Domain 


145. 


.91 Inote 


I "Growth factor domain" 


Domain 


192. 


.173 Inote 


I " Kr ingle -1 domain" 


Domain 


1180 


. .261 Inote 


I "Kringle-2 domain" 


Domain 


1264 


. . 527 Inote 


I "Serine protease domain" 


Misc_dif f erence 


193 


I note 


I "Y changed from wt to T in 
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Misc_dif ference | 103 
I 

Misc_dif ference | 184 
I 

Misc_dif ference | 198 
I 



| | mutant " 

I note I "T changed from wt to A in 

| | mutant " 

I note I "N changed from wt to S in 

| | mutant" 

I note I "G changed from wt to D in 

I | mutant" 



ALIGN Smith-Waterman score: 129 
87 aa overlap starting at 8 

deaiskkinqdf ssllpaamkntvlhcwsvssrgrlascp_egttvtscscgsgcgswdv 

! ! ■ • Z I " • ■ • a • • a • • a a a 

dektqmiyqqhqswlrpvlrsnrveycwcnsgraqchsvpykscseprcfnggtcqqaly 
redtmchc qcgs idwtaarcc 



f sdfvcqcpegfagkcceid_tratcc 
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L12 ANSWER 4 OF 333 
ACCESSION NUMBER 
TITLE : 



INVENTOR: 
PATENT ASSIGNEE: 
PATENT INFO: 
APPLICATION INFO: 
PRIORITY INFO: 
PAT. SEQ. LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE : 
OTHER SOURCE: 
DESCRIPTION: 

KEYWORD: 



DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
95P-R74688 protein DGENE 

Genetically engineered tissue plasminogen activator 
is modified at positions 44-50 and 296-302 and is 
non-glycosylated in Kl and K2 regions, has extended 
half-life and PAI-1 resistance 
Huang C; Huang P; Liu S 

(BIOE-N)BIOENGINEERING INST ACAD MILITARY 



CN 1082111 A 
CN 93-109234 
CN 93-109234 
Claim 2; Fig 1 



940216 
930806 
930806 
and Page 



7 pp 



delta 296-302, 44-50 



04 JAN 1996 (first entry) 
Patent 
Chinese 
95-162457 [22] 
t-PA mutein (N117Q, N184Q, 
replaced by DNCRRPG) 

Tissue plasminogen activator; tPA; thrombolytic agent; 
mutein; deglycosylated kringle domain; PAI-1 resistance 
ORGANISM: Synthetic 
ABSTRACT: 

The sequences given in R74678-R74689 are examples of preferred 
mutant versions of human tPA. In all the muteins, amino acids 296- 
302 of wild-type tPA (involved in interaction with PAI-1) have been 
deleted and the kringle domains have been deglycosylated by substn. 
of Asn 117 in Kl and Asnl84 in K2 by Asp residues. Also, amino 
acids 44-50 of wild-type tPA are replaced by a motif which differs 
between different muteins. The modified tPA proteins have prolonged 
half-life, are resistant to PAI-1 and have affinity for fibrin; 
they are useful as thrombolytic agents 
AMINO ACID COUNTS:32 A; 35 R; 21 N; 29 D; 0 B; 36 C; 28 Q; 26 E; 

39 L; 
23 V; 



43 G; 
25 T; 
520 



14 
13 



H; 
W; 



19 K; 5 M; 16 F; 28 P; 



19 I; 
24 Y; 

SEQUENCE LENGTH: 
SEQUENCE 

1 syqvicrdek tqmiyqqhqs wlrpvlrsnr veycwcnsgr aqcdncrrpg 
51 cseprcfngg tcqqalyfsd fvcqcpegfa gkcceidtra tcyedqgisy 
101 rgtwstaesg aectnwqssa laqkpysgrr pdairlglgn hnycrnpdrd 
151 skpwcyvfka gkyssefcst pacsegnsdc yfgqgsayrg thsltesgas 
201 clpwnsmili gkvytaqnps aqalglgkhn ycrnpdgdak pwchvlknrr 
251 ltweycdvps cstcglrqys qpqfrikggl fadiashpwq aaifaerflc 
301 ggilisscwi lsaahcfqer fpphhltvil grtyrwpge eeqkfeveky 
351 ivhkefdddt ydndiallql ksdssrcaqe sswrtvclp padlqlpdwt 
401 ecelsgygkh ealspfyser lkeahvrlyp ssrctsqhll nrtvtdnmlc 
451 agdtrsggpq anlhdacqgd sggplvclnd grmtlvgiis wglgcgqkdv 
501 pgvytkvtny ldwirdnmrp 



0 

45 



Z; 
S; 



FEATURE TABLE: 
Key 



I Location I Qualifier | 



Domain 



Domain 



II. .50 
I 



I label 
I note 
I 



I 51.. 87 | label 



I f inger_domain 

I "amino acids 44-50 of F domain 
I have been replaced by the 
I sequence DNCRRPG" 
I E_domain 
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\ 



Domain 



Domain 



Domain 



Disulf ide_bond 
Disulf ide_bond 
Disulf ide_bond 
Disulf ide_bond 
Disulf ide_bond 
Disulf ide_bond 
Disulf ide_bond 
Disulf ide_bond 
Disulf ide_bond 
Disulf ide_bond 
Disulf ide_bond 
Disulf ide_bond 
Disulf ide_bond 
Disulf ide_bond 
Disulf ide_bond 
Disulfide bond 



88. .176 



177. .275 



276. .520 



6. .3 
34. . 
51. . 
56. . 
75. . 
92. . 
113. 
144. 
180. 
201. 
232. 
264. 
300. 
308. 
402. 
434. 



Disulfide bond 14 



6 

43 
62 
73 
84 
173 
.155 
.168 
.261 
.243 
.256 
.388 
.316 
.377 
.477 
.450 
67 



note 

label 

note. 



label 
note 



label 
note 



495| 



"growth factor domain" 
Kringle_l 

"substn. of Asnll7 by Asp 
destroys an N- linked 
glycosylation site" 
Kringle_2 

"substn. of Asnl84 by Asp 
destroys an N-linked 
glycosylation site" 
P_domain 

"amino acids 296-302 of native 
tPA have been deleted; these 
residues are involved in 
interaction with PAI-1" 



I 



ALIGN Smith-Waterman score: 129 
83 aa overlap starting at 8 

deaiskkinqdf ssllpaamkntvlhcwsvssrgrlascpe_gttvtscscgsgcgswdv 

• • •••• ••*••• • • • • • 

• • • . • •••• •••••• ••••• • •••• 

dektqmiyqqhqswlrpvlrsnrveycwcnsgraqcdncrrpgcseprcfnggtcqqaly 
redtmchcqcgs idwtaarcct 1 



f sdfvcqcpeg. 



_f agkccei 
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L12 ANSWER 5 OF 333 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
ACCESSION NUMBER: 95P-R74678 protein DGENE 

TITLE: Genetically engineered tissue plasminogen activator - 

is modified at positions 44-50 and 296-302 and is 
non-glycosylated in Kl and K2 regions, has extended 
half-life and PAI-1 resistance 
Huang C; Huang P; Liu S 

(BIOE-N) BIOENGINEERING INST ACAD MILITARY 



INVENTOR : 
PATENT ASSIGNEE: 
PATENT INFO: 
APPLICATION INFO 
PRIORITY INFO: 
PAT. SEQ. LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE : 
OTHER SOURCE: 
DESCRIPTION: 

KEYWORD: 



CN 1082111 A 940216 7 pp 

CN 93-109234 930806 * 
CN 93-109234 930806 
Claim 2; Fig 1 and Page 5 
04 JAN 1996 (first entry) 
Patent 
Chinese 
95-162457 [22] 
t-PA mutein (N117Q, N184Q, 
replaced by ESKPEAEE) 

Tissue plasminogen activator; tPA; thrombolytic agent; 
mutein; deglycosylated kringle domain; PAI-1 resistance 
Synthetic 



delta 296-302, 44-50 



ORGANISM: 
ABSTRACT: 

The sequences given in R74678-R74689 are examples of preferred 
mutant versions of human tPA. In all the muteins, amino acids 296- 
302 of wild-type tPA (involved in interaction with PAI-1) have been 
deleted and the kringle domains have been deglycosylated by substn. 
of Asn 117 in Kl and Asnl84 in K2 by Asp residues. Also, amino 
acids 44-50 of wild-type tPA are replaced by a motif which differs 
between different muteins. The modified tPA proteins have prolonged 
half-life, are resistant to PAI-1 and have affinity for fibrin; 
they are useful as thrombolytic agents 
AMINO ACID COUNTS:33 A; 33 R; 20 N; 28 D; 0 B; 35 C; 28 Q; 30 E; 0 Z; 

39 L; 20 K; 5 M; 16 F; 28 P; 46 S; 
23 V; 



42 G; 
25 T; 
521 



14 H; 
13 W; 



19 I; 
24 Y; 

SEQUENCE LENGTH: 
SEQUENCE 

1 syqvicrdek tqmiyqqhqs wlrpvlrsnr veycwcnsgr aqceskpeae 
51 ecseprcfng gtcqqalyfs dfvcqcpegf agkcceidtr atcyedqgis 
101 yrgtwstaes gaectnwqss alaqkpysgr rpdairlglg nhnycrnpdr 
151 dskpwcyvfk agkyssefcs tpacsegnsd cyfgqgsayr gthsltesga 
201 sclpwnsmil igkvytaqnp saqalglgkh nycrnpdgda kpwchvlknr 
251 rltweycdvp scstcglrqy sqpqfrikgg lfadiashpw qaaifaerfl 
301 cggilisscw ilsaahcfqe rfpphhltvi lgrtyrwpg eeeqkfevek 
351 yivhkefddd tydndiallq lksdssrcaq esswrtvcl ppadlqlpdw 
401 tecelsgygk healspfyse rlkeahvrly pssrctsqhl lnrtvtdnml 
451 cagdtrsggp qanlhdacqg dsggplvcln dgrmtlvgii swglgcgqkd 
501 vpgvytkvtn yldwirdnmr p 



FEATURE TABLE: 
Key 



I Location | Qualifier | 



Domain 



Domain 



II. .51 
I 



I 



I label 
I note 
I 
I 

I 52.. 88 | label 



I f inger_domain 

I "amino acids 44-50 of F domain 
I have been replaced by the 
I sequence ESKPEAEE" 
IE domain 
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\ 



Domain 



Domain 



Domain 



Disulfide. 
Disulfide. 
Disulfide. 
Disulfide. 
Disulfide. 
Disulfide. 
Disulfide. 
Disulfide. 
Disulfide. 
Disulfide. 
Disulfide. 
Disulfide. 
Disulfide. 
Disulfide. 
Disulfide. 
Disulfide. 
Disulfide 



.bond 
.bond 
.bond 
.bond 
.bond 
.bond 
.bond 
.bond 
.bond 
.bond 
.bond 
.bond 
bond 
.bond 
.bond 
bond 
bond 



89. .177 



178. .276 



277. .521 



6. . 
34, 
52. 



36 

.43 

.63 



57. .74 
76. .85 
93. .174 
114. .156 
145. '.169 
181. .262 
202. .244 
233. .257 
265. .389 
301. .317 
309. .378 
403 . .478 
435. .451 
468. .496 



note 

label 

note 



label 
note 



label 
note 



"growth factor domain" 
Kringle_l 

"substn. of Asnll7 (corresp. 
to position 118 in this 
mutein) by Asp destroys an 
N-linked glycosylation site" 
Kringle_2 

"substn. of Asnl84 (corresp. 
to position 185 in this 
mutein) by Asp destroys an 
N-linked glycosylation site" 
P_domain 

"amino acids 296-302 of native 
tPA have been deleted; these 
residues are involved in 
interaction with PAI-1" 



ALIGN Smith-Waterman score: 129 
84 aa overlap starting at 8 

deaiskkinqdf ssllpaamkntvlhcwsvssrgrlascpegttvt scscgsgcgswd 



dektqmiyqqhqswlrpvlrsnrveycwcnsgraqceskpeaeecseprcfnggtcqqal 
vredtmchcqcgsidwtaarcctl 



yf sdfvcqcpeg f agkccei 



SEARCHED ON 26 OCT 1998 

FILE LAST UPDATED: 18 OCT 1998 <19981018/UP> 



10 




Human C23 

^1KALCLLLLPVLGLLVSSKTLCSMEEAINERIQEVAGSLIFRAISSIGLECQSVTSRGDLATCPRGFAVTGCTCGSAC 
GSWDVRAETTCHCQCAGMDWTGARCCRVQP 

20 ANSWERS FOUND ABOVE A THRESHOLD OF 66 

Similarity 



Score 



100 



50 



Answer Count 



10 



20 



30 



40 



50 
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LI 4 ANSWER 1 OF 20 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
ACCESSION NUMBER: 97P-W24566 peptide DGENE 

TITLE: Serine protease from Streptomyces griseus ATCC 55178 - 

with good stability in presence of urea or guanidine, 
useful in cleaning compositions, including laundry and 
dishwashing detergents 

INVENTOR : Leigh S D 

PATENT ASSIGNEE: (CLRX)CLOROX CO 

PATENT INFO: US 5646028 A 970708 16 pp 

APPLICATION INFO: US 91-718303 910618 
PRIORITY INFO: US 91-718303 910618 

US 92-973343 921106 

US 94-292924 940818 

US 95-544143 951017 
PAT. SEQ. LOC: Claim 3; Column 25 
DATA ENTRY DATE: 05 NOV 1997 (first entry) 
DOCUMENT TYPE: Patent 
LANGUAGE : English 
OTHER SOURCE: 97-362936 [33] 

DESCRIPTION: Serine protease C-terminal sequence 

KEYWORD: Serine protease; C-terminus; Streptomyces griseus; 

guanidine; pre-soak; cleaning composition; laundry 
detergent; additive composition; enzyme; dishwasher 
detergent; drain opener; urea; contact lens cleanser; 
proteinaceous stain 

ORGANISM: Streptomyces griseus variety alkaliphilus No. 33 

ABSTRACT : 

This sequence represents the C-terminal sequence of the serine 
protease of the invention. The serine protease was isolated from 
Streptomyces griseus variety alkaliphilus No. 33 (ATCC 55178) . The 
protease has an apparent molecular weight of 19 kD (by reducing 
sodium dodecylsulphate polyacrylamide gel electrophoresis), and 
improved stability against urea and guanidine. The serine protease 
is specific for the substrate represented by W24567, but also 
recognises the substrates shown in W26078-W26096 . The protease is 
inhibited by phenylmethylsulphonyl fluoride. The serine protease is 
useful in liquid or granular cleaning compositions, specifically 
laundry detergents or additive compositions. It is also useful in 
automatic dishwasher detergents, pre-soaks, drain openers, contact 
lens cleansers etc. The protease has better activity against 
proteinaceous stains than known enzymes and unusually high 
stability in the presence of chaotropic agents 

AMINO ACID COUNTS:4 A; 4 R; 3 N; 1 D; 0 B; 4 C; 9 Q; 0 E; 0 Z; 

18 G; 2 H; 6 I; 3 L; 0 K; 0 M; 2 F; 4 P; 15 S; 
17 T; 1 W; 2 Y; 7 V; 

SEQUENCE LENGTH: 102 

SEQUENCE 

1 vtgstqatvg ssicrsgstt gwrcgtiqqh ntsvtypqgt itgvtrtsac 
51 aqpgdsggsf isgtqaqgvt sggsgncsig gttfhqpvnp ilsqygltlv 
101 rs 

ALIGN Smith-Waterman score: 101 
75 aa overlap starting at 6 

qevagslifraissiglecqsvtsrgdlatcprgfavtgctcgsacgs_wdvraettchc 

qatvgssicrsgsttgwrcgtiqqhntsvtypqg_titgvtrtsacaqpgdsggsf isgt 
qcagmdwtgarccrv 



qaqgvtsggsgncsi 
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L14 ANSWER 2 OF 20 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 

ACCESSION NUMBER: 95P-R77256 Protein DGENE 

TITLE: Pure, truncated fungal cellulase protein from 

Trichoderma - useful to reduce or eliminate dye, 
colourant or pigment back-staining or redeposition in 
stone-washing or bio-polishing 

INVENTOR: Clarkson K A; Collier K D; Fowler T; Larenas E; Ward M 

PATENT ASSIGNEE: (GEMV) GENENCOR INT INC 

PATENT INFO: WO 9516782 A 950622 105 pp 

APPLICATION INFO: WO 94-US14163 941219 

PRIORITY INFO: US 93-169948 931217 

PAT. SEQ. LOC: Claim 12; Page 38 

DATA ENTRY DATE: 13 DEC 1995 (first entry) 

DOCUMENT TYPE: Patent 

LANGUAGE : Eng 1 i sh 

OTHER SOURCE: 95-231574 [30] 

CROSS REFERENCES: N-PSDB: 95N-Q91276 

DESCRIPTION: Truncated endoglucanase EG I catalytic core 

KEYWORD: Cellulase; catalytic core; enzyme 

ORGANISM: Trichoderma longibrachiatum 

ABSTRACT: 

A truncated fungal cellulase of Trichoderma comprising a EG I 
catalytic core with the sequence in R77256, which is encoded by 
Q91276, is claimed. The truncated cellulase is capable of 
endoglucanase activity. Genes for EG I and EGII have been isolated 
from T. longibrachiatum and the protein domain structure has been 
confirmed (Penttila, M. et al . 1986, Gene 45, 253-263; Van Arsdell, 
J.N. et al. 1987, Bio/Technology 5, 60-64; Saloheimo, M. et al., 
1988, Gene 63, 11-21) 

AMINO ACID COUNTS:2 A; 0 R; 2 N; 1 D; 0 B; 6 C; 4 Q; 0 E; 0 Z; 5 

G; 0 H; 0 I; 1 L; 0 K; 0 M; 0 F; 2 P; 7 S; 2 
T; 2 W; 3 Y; 2 V; 

SEQUENCE LENGTH: 39 

SEQUENCE 

1 qacssvwgqc ggqnwsgptc casgstcvys ndyysqclp 
ALIGN Smith-Waterman score: 79 

13 aa overlap starting at 9 
qcagmdwtgarcc 



qcggqnwsgpt cc 
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20 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
96P-R98208 Protein DGENE 

Cell-targetted retroviral vector particles - having 
envelope protein modified with targetting polypeptide 
Anderson W; Chiang Y L; Januszeski M; Mackrell A J.- 
Zhao Y 

(GENE-N) GENETIC THERAPY INC 
UNIV SOUTHERN CALIFORNIA 

WO 9630504 Al 961003 73 pp 

WO 96-US3908 960322 
US 95-409648 950324 
Example 2; Page 36 
30 DEC 1996 (first entry) 
Patent 
English 
96-455352 [45] 

Nucleotide used in production of MSH/MoMuLV chimeric 
sequence 

Moloney murine leukaemia virus; gp70; 4070A retrovirus; 
retrovirus; 10A1 murine leukaemia virus; NZB-9-1 murine 
leukaemia virus; poly tropic MX27 provirus; targetted 
drug delivery; gene therapy; single chain antibody; 
envelope protein; ss 
ORGANISM: Synthetic 
ABSTRACT: 

Cell targetted retroviral vector particles can be used in gene 
therapy to deliver a heterologous gene to a target cell for 
expression of a heterologous polypeptide in that cell. The cell 
targetted retroviral vector particles comprise an envelope protein 
which is modified to contain a targetting polypeptide (a single 
chain antibody) , or in the case of moloney murine leukaemia virus 
(MoMuLV) , alpha melanotropin-stimulating hormone (MSH) . Two 
oligonucleotides (R98207, R98208) were used to substitute sequences 
in MoMuLV for MSH sequences. This oligonucleotide was used to 
replace residues G80-P88 of MoMuLV envelope protein (See W04248) 
AMINO ACID COUNTS: 8 A; 0 R; 0 N; 0 D; 0 B; 17 C; 0 Q; 0 E; 0 Z; 8 

0 H; 0 I; 0 L; 0 K; 0 M; 0 F; 0 P; 0 
0 W; 0 Y; 0 V; 



LI 4 ANSWER 3 OF 
ACCESSION NUMBER: 
TITLE : 

INVENTOR : 

PATENT ASSIGNEE: 

(UYSC-N) 
PATENT INFO: 
APPLICATION INFO: 
PRIORITY INFO: 
PAT. SEQ. LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE : 
OTHER SOURCE: 
DESCRIPTION: 

KEYWORD: 



SEQUENCE 
SEQUENCE 



LENGTH : 



G; 
T; 
44 



S; 11 



1 catttccgat ggtgcaagcc ggtattaacc tccctcaccc ctcg 
ALIGN Smith-Waterman score: 83 
43 aa overlap starting at 5 

tcprgf avtgctcgsacgswdvraettchcqcagmdwtgarcc 



tccgatggtgcaagccggtattaacctccctca. 



cc 
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LI 4 ANSWER 4 OF 
ACCESSION NUMBER: 
TITLE: 

INVENTOR: 
PATENT ASSIGNEE: 
PATENT INFO: 
APPLICATION INFO: 
PRIORITY INFO: 

PAT. SEQ. LOC: 
DATA ENTRY DATE: 
DOCUMENT TYPE: 
LANGUAGE : 
OTHER SOURCE: 
CROSS REFERENCES: 
DESCRIPTION: 
KEYWORD : 

ORGANISM: 
ABSTRACT : 

The sequence is that of wheat germ agglutinin isolectin A, WGA-D, 
which may be expressed in transgenic plants to provide plants 
(pref. tobacco) having insecticidal and fungicidal properties in 
their leaves 

AMINO ACID COUNTS:23 A; 5 R; 11 N; 5 D; 0 B; 32 C; 12 Q; 5 E; 0 Z; 

42 G; 2 H; 3 I; 9 L; 8 K; 6 M; 6 F; 6 P; 16 S; 9 

T; 3 W; 7 Y; 3 V; 
SEQUENCE LENGTH: 213 
SEQUENCE 

1 mrkmmstmal tlgaavflaf aaataqaqrc geqgsnmecp nnlccsqygy 
51 cgmggdycgk gcqngacwts krcgsqagga tcpnnhccsq yghcgfgaey 
101 cgagcqggpc radikcgsqs ggklcpnnlc csqwgfcglg sefcgggcqs 
151 gacstdkpcg kdaggrvctn nyccskwgsc gigpgycgag cqsggcdavf 
201 agaitanstl lae 
ALIGN Smith-Waterman score: 85 

76 aa overlap starting at 13 

gslif raissiglecqsvtsrgdlatcp rgf_avtgctcgsacgswdvraett 

gaavf lafaaataqaqrcgeqgsnmecpnnlccsqygycgmggdycgkgcqngacwtskr 
chcqcagmdwtgarcc 

cgsqaggatcpnnhcc 




20 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
94P-R45359 Protein DGENE 

Transgenic plant contg. cDNA encoding wheat or barley 
lectin - has insecticidal properties in its leaves 
Raikhel N V 

(UNMS)UNIV MICHIGAN STATE 

US 5276269 A 940104 26 pp 

US 92-917665 920720 

US 89-406318 890912 . 

US 92-917665 920720 

Disclosure; Fig 6 

06 JUL 1994 (first entry) 

Patent 

English 

94-016167 [02] 

N-PSDB: 94N-Q54433 

Wheat germ agglutinin isolectin WGA-D 
Transgenic plant; leaf; leaves; insecticidal; 
fungicidal; properties; tobacco; gramineae 
Triticum aestivum L 
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L14 ANSWER 5 OF 20 DGENE COPYRIGHT 1998 DERWENT INFORMATION LTD 
ACCESSION NUMBER: 96P-W02025 Protein DGENE 

TITLE: Treatment of cellulose-contg . fabrics such as denim, 

e.g. stone :washing - using truncated cellulase enzyme 
to increase abrasion and give reduced redeposition of 
dye 

INVENTOR: Clarkson K A; Collier K D; Fowler T; Larenas E; Ward M 

PATENT ASSIGNEE: (GEMV) GENENCOR INT INC 

PATENT INFO: WO 9623928 Al 960808 . 124 pp 

APPLICATION INFO: WO 96-US977 960129 

PRIORITY INFO: US 95-382452 950201 

PAT. SEQ. LOC: Disclosure; Fig 2A-2E 

DATA ENTRY DATE: 28 OCT 1996 (first entry) 

DOCUMENT TYPE: Patent 

LANGUAGE : Engl ish 

OTHER SOURCE: 96-371466 [37] 

CROSS REFERENCES: N-PSDB: 96N-T32221 

DESCRIPTION: Trichoderma cellobiohydrolase II 

KEYWORD: Cellobiohydrolase II; CBHII; cellulase; cellulose; 

denim; stonewashing; dye redeposition; backstaining 
ORGANISM: Trichoderma longibrachiatum 

ABSTRACT: 

The amino acid sequences for Trichoderma longibrachiatum 
cellobiohydrolase I (CBHI) (W02022), CBHII (W02025) , endoglucanase 
I (EGI) (W02029), EGII (W02032) and EGIII (W02034) were deduced 
from the respective genomic DNA sequences (T32220-24) . The CBHI, 
CBHII, EGI and EGII enzymes have catalytic core domains useful for 
reducing dye redeposition (backstaining) on cellulose-contg. 
fabrics such as denim, whilst maintaining or increasing abrasion 
during stonewashing. Truncated enzymes comprising these catalytic 
core domains can be obtd. by proteolysis of the complete enzyme or 
by inserting the appropriate DNA fragment into a vector, using this 
to transform a Trichoderma sp. host cell, and recovering the 
recombinant core domain 

AMINO ACID COUNTS:60 A; 14 R; 30 N; 21 D; 0 B; 12 C; 21 Q; 10 E; 0 Z; 

40 G; 4 H; 18 I; 38 L; 10 K; 5 M; 12 F; 32 P; 47 S; 
38 T; 12 W; 20 Y; 27 V; 

SEQUENCE LENGTH: 471 

SEQUENCE 

1 mivgilttla tlatlaasvp leerqacssl wgqcggqnws gptccasgst 
51 cvysndyysq clpgaassss straasttsr vspttsrsss atpppgsttt 
101 rvppvgsgta tysgnpfvgv tpwanayyas evsslaipsl tgamataaaa 
151 vakvpsfmwl dtldktplme qtladirtan knggnyagqf widlpdrdc 
201 aalasngeys iadggvakyk nyidtirqiv veysdirtll viepdslanl 
251 vtnlgtpkca naqsayleci nyavtqlnlp nvamyldagh agwlgwpanq 
301 dpaaqlfanv yknasspral rglatnvany ngwnitspps ytqgnavyne 
351 klyihaigpl lanhgwsnaf fitdqgrsgk qptgqqqwgd wcnvigtgfg 
401 irpsantgds lldsfvwvkp ggecdgtsds saprfdshca lpdalqpapq 
451 agawfqayfv qlltnanpsf 1 



FEATURE TABLE: 

Key I Location | Qualifier I 

Peptide I1..24 I label |Sig_peptide 

Protein 1 25.. 471 I label |Mat_protein 

Domain I25..63 I label I Cellulose_binding_domain 
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Region 
Domain 



I 64. .106 | label 
1107. .47 1| label 
I I note 



I Linker_region 

|Catalytic_core_domain 

I "catalytic core domain is the 

I preferred domain for use in 

I constructs of the invention" 



ALIGN Smith-Waterman score : 82 

42 aa overlap starting at 4 
gfavtgctcgsacgswdvraettchc qcagmdwtgarcc 



gilttlatlatlaasvpleerqacsslwgqcggqnwsgptcc 
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